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Abstract 

The dynamic optimality conjecture is perhaps the most fundamental open question about binary 
search trees (BST). It postulates the existence of an asymptotically optimal online BST, i.e. one that is 
constant factor competitive with any BST on any input access sequence. The two main candidates for 
dynamic optimality in the literature are splay trees [Sleator and Tarjan, 1985], and Greedy [Lucas, 1988; 
Munro, 2000; Demaine et al. 2009]. Despite BSTs being among the simplest data structures in computer 
science, and despite extensive effort over the past three decades, the conjecture remains elusive. Dynamic 
optimality is trivial for almost all sequences: the optimum access cost of most length-n sequences is 
0(nlogn), achievable by any balanced BST. Thus, the obvious missing step towards the conjecture is 
an understanding of the “easy” access sequences, and indeed the most fruitful research direction so far 
has been the study of specific sequences, whose “easiness” is captured by a parameter of interest. For 
instance, splay provably achieves the bound of 0{nd) when d roughly measures the distances between 
consecutive accesses (dynamic finger), the average entropy (static optimality), or the delays between 
multiple accesses of an element (working set). The difficulty of proving dynamic optimality is witnessed 
by other highly restricted special cases that remain unresolved; one prominent example is the traversal 
conjecture [Sleator and Tarjan, 1985], which states that preorder sequences (whose optimum is linear) are 
linear-time accessed by splay trees; no online BST is known to satisfy this conjecture. 

In this paper, we prove two different relaxations of the traversal conjecture for Greedy: (i) Greedy 
is almost linear for preorder traversal, (ii) if a linear-time preprocessing is allowed, Greedy is in fact 
linear. These statements are corollaries of our more general results that express the complexity of access 
sequences in terms of a pattern avoidance parameter k. Pattern avoidance is a well-established concept 
in combinatorics, and the classes of input sequences thus dehned are rich, e.g. the fc = 3 case includes 
preorder sequences. For any sequence X with parameter k, our most general result shows that Greedy 
achieves the cost n2“^"^ ' ' where a is the inverse Ackermann function. Furthermore, a broad subclass 
of parameter-fc sequences has a natural combinatorial interpretation as k-decomposable sequences. For 
this class of inputs, we obtain an ^ bound for Greedy when preprocessing is allowed. For A; = 3, 

these results imply (i) and (ii). To our knowledge, these are the first upper bounds for Greedy that 
are not known to hold for any other online BST. To obtain these results we identify an input-revealing 
property of Greedy. Informally, this means that the execution log partially reveals the structure of the 
access sequence. This property facilitates the use of rich technical tools from forbidden submatrix theory. 

Further studying the intrinsic complexity of fc-decomposable sequences, we make several observations. 
First, in order to obtain an offline optimal BST, it is enough to bound Greedy on non-decomposable 
access sequences. Furthermore, we show that the optimal cost for ^-decomposable sequences is 0(n log k), 
which is well below the proven performance of all known BST algorithms. Hence, sequences in this class 
can be seen as a “candidate counterexample” to dynamic optimality. 


‘Work partly done while at Saarland University. 

^The purpose of preprocessing is to bring the data structure into a state of our choice. This state is independent of the 
actual input. This kind of preprocessing is implicit in the Demaine et al. definition of Greedy. 
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1 Introduction 


The binary search tree (BST) model is one of the most fundamental and thoroughly studied data access 
models in computer science. In this model, given a sequence X S [n]™ of keys, we are interested in OPT(X), 
the optimum cost of accessing X by a binary search tree. When the tree does not change between accesses, 
OPT is well understood: both efficient exact algorithms (Knuth [22| ) and a linear time approximation 
(Mehlhorn [^) have long been known. By contrast, in the dynamic BST model (where restructuring of the 
tree is allowed between accesses) our understanding of OPT is very limited. No polynomial time exact or 
constant-approximation algorithms are known for computing OPT in the dynamic BST model. 

Theoretical interest in the dynamic BST model is partly due to the fact that it holds the possibility 
of a meaningful “instance optimality”, i.e. of a BST algorithm that is constant factor competitive (in the 
amortized sense) with any other BST algorithm, even with those that are allowed to see into the future. The 
existence of such a “dynamically optimal” algorithm has been postulated in 1985 by Sleator and Tarjan [34]; 
they conjectured their splay tree to have this property. Later, Lucas 23 and independently Munro 27 


proposed a greedy offline algorithm as another candidate for dynamic optimality. In 2009 Demaine, Harmon, 
lacono, Kane, and Patra§cu (DHIKP) presented a geometric view of the BST model, in which the Lucas- 
Munro offline algorithm naturally turns into an online one (simply called Greedy). Both splay and Greedy 
are considered plausible candidates for achieving optimality. However, despite decades of research, neither of 
the two algorithms are known to be o(log n)-competitive (the best known approximation ratio for any BST 
is O(loglogn) 12 , but the algorithms achieving this bound are less natural). Thus, the dynamic optimality 
conjecture remains wide open. 

Why is dynamic optimality difflcult to prove (or disprove)? For a vast majority of access sequences, 
the optimal access cost by any dynamic BST (online or offline) i^0(TOlogn) and any static balanced tree 
can achieve this boundj^ Hence, dynamic optimality is almost trivial on most sequences. However, when an 
input sequence has complexity o(m log n), a candidate BST must “learn” and “exploit” the specific structure 
of the sequence, in order to match the optimum. This observation motivates the popular research direction 
that aims at understanding “easy” sequences, i.e. those with OPT = 0{md) when d is a parameter that 
depends on the structure of the sequence; typically d = o(logn). Parameter d can be seen as a measure of 
“easiness”, and the objective is to prove that a candidate algorithm achieves the absolute bound of 0{md); 
note that this does not immediately imply that the algorithm matches the optimum. 

The splay tree is known to perform very well on many “easy” sequences: It achieves the access time of 
0{md) when d is a parameter that roughly captures the average distances between consecutive accesses mg. 
the average entropy, or the recentness of the previous accesses |34j ; these properties are called dynamic 
finger, static optimality and working set bounds respectively (the latter two bounds are also satisfied by 


Greedy 15 ). The notorious difflculty of dynamic optimality is perhaps witnessed by the fact that many 


other highly restricted special cases have resisted attempts and remain unresolved. These special cases stand 
as a roadblock to proving (or disproving) the optimality of any algorithm: proving that A is optimal requires 
proving that it performs well on the easy inputs; refuting the optimality of A requires a special “easy” 
subclass on which A is provably inefficient. In any case, a better understanding of easy sequences is needed. 


but currently lacking, as observed by several authors 11 29 


One famous example of such roadblocks is the traversal conjecture, proposed by Sleator and Tarjan in 
1985. This conjecture states that starting with an arbitrary tree T (called initial tree) and accessing its keys 
using the splay algorithm in the order given by the preorder sequence of another tree T' takes linear time. 
Since the optimum of such preorder sequences is linear, any dynamically optimal algorithm must match this 
bound. After three decades, only very special cases of this conjecture are known to hold, namely when T' 
is a monotone path (the so-called sequential access 36 ) or when T' = T fj Prior to this work, no online 


BST algorithm was known to be linear on such access sequences. 


Motivated by well-established concepts in combinatorics 20 21,37 , in this paper we initiate the study 


of the complexity of access sequences in terms of pattern avoidance parameters. Our main contributions are 


^This observation appears to be folklore. We include a proof in Appendix F 

^As customary, we only consider successful accesses, we ignore inserts and aeletes, and we assume that m > n. 
^This result implies an offline BST with 0(n) cost for accessing a preorder sequence, by first rotating T to T'. 
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Table 1: Easy sequences for BST and best known upper bounds. Results referenced as [*] are new. The symbol “?” 

means that no nontrivial bound is known. In the first rows, fi is the number of times that i is accessed, 
xt is the element accessed at time t, and rt is the number of distinct accesses since the last access of xt). 
In the ’’Sequential”, ’’Traversal”, and ’’Pattern-avoiding” rows it is assumed that the access sequence is a 
permutation, i.e. m = n. 


two-fold (the precise statement of the results is in § |1.1[ and the results are summarized in Table [^. 

(i) We study access sequences parametrized by pattern avoidance and analyze the access time of Greedy 
in terms of this parameter. As a by-product, we almost settle the traversal conjecture for Greedy 
in two orthogonal directions: (a) Greedy is “almost linear” for the traversal conjecture, and (b) if 
a linear-cost preprocessing is allowed. Greedy is in fact linear. This is perhaps the first evidence in 
support of Greedy as a candidate for dynamic optimality (all previously known properties of Greedy 
were subsumed by splay). These results are derived via an input-revealing property of Greedy, in the 
sense that the execution log of Greedy partially reveals the structure of the input sequence. 

(ii) We study a decomposability property of sequences and show that a wide subclass of the pattern¬ 
avoiding class satisfies this property. This allows us to (a) identify the core difficulty of designing an 
offline algorithm for dynamic optimality, (b) obtain a tight bound for the optimum of this input class, 
and (c) derive a simple proof that Cole’s showcase sequence is linear. 


1.1 Our results 


We first define the pattern avoidance parameters. Let [n] = {1,... ,n} be the set of keys. We represent a 
length-m sequence by an m-tuple X = {xi,... ,Xm) € [n]™. Throughout the paper, we assume that X is 
a permutation, i.e. m = n and Xi ^ Xj for all i ^ j. This is justified by Theorem 2.1 but we remark that 
many of our results can be extended to non-permutation access sequences. 

Let Sk denote the set of all permutations in {k\^■ Given A £ 5'„, we say that X contains a permutation 
pattern tt £ , if there are indices 1 < fi < 12 < • • • < n, such that {xi ^, ■ • •, Xi ^) is order-isomorphic to 

TT = (tti, ..., TTfc). Otherwise we say that X avoids tt, or that X is n-free. Two sequences of the same length 
are order-isomorphic if they have the same pairwise comparisons, e.g. (5,8, 2) is order-isomorphic to (2,3,1) 
(Figure [^. As examples, we mention that (2, l)-free and (2, 3, l)-free sequences are exactly the sequential, 
respectively, preorder access sequences. 

For each permutation A, a pattern avoidance characteristic of A, denoted by Sjf, is the set of all 
patterns not contained in A, i.e. Sx = {tt £ 5^ : ^ > 1 and A is 7r-free}. The pattern avoidance parameter 
of A, denoted by k{X), is the minimum fc £ N for which Sk H Sx 7 ^ 0- 
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For each k G N, the pattern avoidance class Ck contains all sequences X with pattern avoidance parameter 
k. This characterization is powerful and universal: Ci C for all i, and C„+i contains all sequences. 


Bounds for Greedy in terms of k. We study the access cost of Greedy in terms of the pattern 
avoidance par ameter. Our most general upper bound of ' holds for any access sequence in Ck 

(Theorem lljJ). Later, we show a stronger bound of for two broad subclasses of Ck that have intuitive 

geometric interpretation: fc-decomposable sequences (Theorem |1.4[ ), which generalize preorder sequences and 
^-increasing sequences (Theorem |1.6[ ), which generalize sequential access. 

Theorem 1.1. Let X G Ck be an access sequence of length n. The cost of accessing X using Greedy is at 
most '. 


The following corollary simply follows from the fact that all preorder sequences X belong to C 3 (plugging 
k = 3 into the theorem). 

Corollary 1.2 (“Almost” traversal conjecture for Greedy). The cost of accessing a preorder sequence of 
length n using Greedy, with arbitrary initial tree is at most '. 


Next, we show sharper bounds for Greedy when some structural property of Sx is assumed. First, we 
prove a bound when Ex contains all “non-decomposable” permutations of length at least A: -|- 1. We show 
that this property leads to a decomposition that has intuitive geometric meaning, which we describe below. 

Given a permutation a = (ui,..., cr^) G Si, we call a set [a,b] a block of a, if {tJai Ca+i,..., (Jb} = 
{c, c + 1,..., d} for some integers c,d G [i]. In words, a block corresponds to a contiguous interval that is 
mapped to a contiguous interval. We say that a permutation a G Si is decomposable if there is a block of a 
of size strictly betweeen 1 and otherwise, we say that it is non-decomposable (a.k.a. 

We say that an input sequence X is decomposable into k blocks if there exist disjoint [ai, 5i],..., [a^, bk] 
such that each [ai,bi] is a block for X and (UJ®*)^*]) FlN = [u]. The recursive decomposition complexity of 
a sequence X is at most d if one can recursively decompose X into subblocks until singletons are obtained, 
and each step creates at most d blocks (equivalently we say that X is d-recursively decomposable, or simply 
d-decomposable). See Figure It is easy to see that preorder sequences are 2-decomposablej^ 

The following lemma (proof in Lemma |C.1[ ) connects pattern avoidance and recursive decomposition 
complexity of a sequence. 

Lemma 1.3. Let X G Sn- Then Ex contains all simple permutations of length at least k -\-1 if and only if 
X is k-decomposable. 

This lemma implies in particular that if X is fc-decomposable, then X G Ck+i and Theorem o can 
be applied. The following theorem gives a sharper bound, when a preprocessing (independent of A) is 
performed. This yields another relaxed version of the traversal conjecture. 


Theorem 1.4. Let X be a k-decomposable access sequence. The cost of accessing X using Greedy, with a 
linear-cost preprocessing, is at most \ 


Corollary 1.5 (Traversal conjecture for Greedy with preprocessing). The cost of accessing a preorder 
sequence using Greedy, with preprocessing, is linear. 


Our preprocessing step is done in the geometric view (explained in §[^ in order to “hide” the initial tree 
T from Greedy. In the corresponding tree-view, our algorithm preprocesses the initial tree T, turning it 
into another tree T', independent of X, and then starts accessing the keys in X using the tree-view variant 
of Greedy (as defined in [^). We mention that DHIKP 
their algorithm (implicitly) performs this preprocessing. 


II define Greedy without initial tree, and thus 


®See for a survey of this well-studied concept. The decomposition described here appears in the literature as substitution- 
decomposition. 

®The entire set of 2-decomposable permutations is known as the set of separable permutations 1^. 
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As another natural special case, we look at Sx 2 {(fcj---)l)}) for some value k. In words, X has 
no decreasing subsequence of length k (alternatively, X can be partitioned into at most k — 1 increasing 
subsequence^. Observe that the k = 2 case corresponds to sequential access. We call this the k-increasing 
property of X. Similarly, if Ex O {(!,..., fc)}, we say that X is k-decreasing. We obtain the following 
generalization of the sequential access theorem (this result holds for arbitrary initial tree). 

Theorem 1.6. Let X be a k-increasing or k-decreasing sequence. The cost of accessing X using Greedy 
is at most . 


All the aforementioned results are obtained via the input-revealing property of Greedy (the ideas are 
sketched in § 1.21. 


Bound for signed Greedy in terms of k. We further show the applicability of the input-revealing 
technique. Signed Greedy (denoted SGreedy) is an algorithm similar to Greedy. It does not always 
produce a feasible BST solution, but its cost serves as a lower bound for the optimum. Let us denote by 
A{X) and OPT(vA) the cost of an algorithm A on X, respectively the optimum cost of X. Then we have 
A{X^) > OPT(X), and OPT(X) = II(SGreedy(X)). 

Conjecture 1 (Wilber [^, using [^). OPT(X) = 0(SGreedy(X)) for all input sequences X. 

We show that our techniques can be used to upper bound the cost of SGreedy. 

Theorem 1.7. The cost o/SG reedy on an access sequence X £ Ck is at most n2^^^\ 

Corollary 1.8. Let A be an algorithm. If Conjecture^^is true, then one of the following must hold: 

(i) The cost of accessing X using A is at most n2^^'^£ for all X £ Ck, for all k. In particular, the traversal 
conjecture holds for A. 

(ii) A is not dynamically optimal. 


Decomposability. Next, we prove a general theorem on the complexity of sequences with respect to 
the recursive decomposition. Let X £ Sn he an arbitrary access sequence. We represent the process of 
decomposing X into blocks until singletons are obtained by a tree T, where each node u in 7” is associated 
with a sequence Xy (see Figure [^. 

Theorem 1.9 (Decomposition theorem, informal). OPT(Ai) < GREEDY(7f^) -\-0{n). 

Corollary 1.10. Let X be a k-decomposable sequence of length n (with k possibly depending on n). Then 
X has optimum cost OPT(Ai) < 0(n log fc). 


This result is tight for all values of k, since for all k, there is a fc-decomposable sequence whose complexity 
is VlinXogk) (Proof in Appendix G.2|. This result gives a broad range of input sequences whose optimum 
is well below the performance of all known BST algorithms. The class can thus be useful as “candidate 
counterexample” for dynamic optimality. (Since we know OPT, we only need to show that no online al¬ 
gorithm can match it.) We remark that Greedy asymptotically matches this bound when k is constant 
(Theorem |1.4[ ). 

The following observation follows from the fact that the bound in Theorem 1.9 is achieved by our proposed 
offline algorithm, and the fact that OPT(X) > Lemma D.4). 


Corollary 1.11. //Greedy is constant competitive on non-decomposable sequences, then there is an offline 
0 (1)-approximate algorithm on any sequence X. 


This corollary shows that, in some sense, non-decomposable sequences serve as the “core difficulty” 
proving dynamic optimality, and understanding the behaviour of Greedy on them might be sufficient. 


of 


^This equivalence is proven in Lemma 


G.2 
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Finally, Cole et al. 10 developed sophisticated technical tools for proving the dynamic finger theorem for 
splay trees. They introduced a “showcase” sequence to illustrate the power of their tools, and showed that 
splay has linear cost on such a sequence. Theorem |1.9| immediately gives an upper bound on the optimum 
access cost of this sequence. 


Corollary 1.12 (informal). Let X he Cole’s showcase sequence. Then OPT(X) = 0{n). 
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Figure 1: From left to right: (i) plot of permutation a = (6, 1,3, 2, 8, 7, 4, 5) containing (2,1,3) (highlighted) and 
avoiding (4, 3,2,1), (ii) permutation cr, and (hi) its block decomposition tree; permntation (3,1,4, 2) at 
the root is obtained by contracting the four blocks into points, (iv) simple permutation (6, 1, 8 , 4, 2, 7, 3, 5). 


1.2 Overview of techniques 

We sketch the main technical ideas of our proofs. First, we define roughly the execution log matrix of 
Greedy. The log of executing Greedy on X G [n]"* is an n-by-m matrix Gx such that Gx{a,t) = 1 if and 
only if element a is touched by Greedy at time t. The cost of this execution is exactly the number of ones 
in Gx, denoted by w{Gx) (we make this claim precise in §[^. 

A submatrix of Gx is obtained by removing some rows and columns of Gx ■ We say that Gx contains a 
matrix B if there is a submatrix B' of Gx such that B{i,j) < for all i,j (in words, any ‘one’ entry 

in B must also be a one in B'). 

We say that an algorithm A is input-revealing if there is a constant-size matrix B such that any submatrix 
of an execution matrix of A containing B must contain an access point. When such a matrix B exists for an 
algorithm, we say that B is a. capture gadget. The existence for Greedy of a capture gadget B allows us to 
observe that Gx contains the patterrj^Q ® B only if the input sequence contains the pattern defined by Q. 
So, if we execute Greedy on an input sequence X £Ck that is Q-free for Q G Sk, then Gx cannot contain 
Q ® B. Now, we can use results from forbidden submatrix theory for bounding w{Gx)- In short, forbidden 
submatrix theory is a collection of theorems of the following type: 

Theorem 1.13. Let M and P be n-hy-n and k-by-k matrices respectively, such that k < n (mostly k should 
be thought of as constant). If M does not contain P, then w(M) < nfp(n,k). 

The function fp depends on the matrix P that is forbidden in M. For instance, if P contains at most 
one 1-entry per row and column, then fp(ri,k) is only a function of k. If P contains at most one 1-entry 
per column but possibly multiple Is per row, then fp{n,k) = The tightness of the bounds we 

obtain depends on the structure of the capture gadget B. This is the reason we have a weaker bound for 
any X G Ck and stronger bounds when X is more restricted. 

We also develop a different set of tools to study the intrinsic complexity of decomposable sequences. 
Here, in order to compute bounds on OPT, we decompose the execution matrix Mx of an algorithm in a 
way that mirrors the recursive decomposition of the input sequence X. Unfortunately, the behaviour of 
Greedy is too capricious to afford such a clean decomposition. We therefore modify Greedy to reduce the 
interference between blocks in the input sequence. The modified algorithm might perform more work locally 
than Greedy, but globally it is more “robust”. It is however, an offline algorithm; whether there exists an 
online algorithm competitive with our robust Greedy, is an interesting open question. 

denotes the tensor (Kronecker) product, to be defined later. 
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Related work. For an overview of results related to dynamic optimality, we refer to the survey of la- 
cono fT^. Besides Tango trees, other C)(loglogn)-competitive BST algorithms have been proposed, similarly 

all the known bounds can 


using Wilber’s first bound (multi-splay and chain-splay [^). Using 13 


be simultaneously achieved by combining several BST algorithms. Our study of the execution log of BST 
algorithms makes use of the geometric view of BST introduced by DHIKP 11 . 

Pattern avoidance in sequences has a large literature in combinatorics 20 37 , as well as in computer 
science [^|^[^[^[^. The theory of forbidden submatrices can be traced back to a 1951 problem of 
Zarankiewicz, and its systematic study has been initiated by Fiiredi 16 , and Bienstock and Gyori [^. Our 
use of this theory is inspired by work of Pettie 29 30 , who applied forbidden pattern techniques to prove 


bounds about data structures (in particular he proves that splay trees achieve 0 {na*{n)) cost on deque- 
sequences, and he reproves the sequential access theorem for splay trees). There are marked differences in 
our use of the techniques compared with the work of Pettie. In particular, we look for patterns directly in 
the execution log of a BST algorithm, without any additional encoding step. Furthermore, instead of using 
fixed forbidden patterns, we make use of patterns that depend on the input. 


Organization of the paper. In §[^we give a short description of the geometric view of the BST model 
(following DHIKP II ), and we introduce concepts used in the subsequent sections. In §[^we study the input- 


revealing property of Greedy, and prove our bounds for pattern-avoiding sequences. In §|^ we introduce 
a robust Greedy, state the decomposition theorem and its applications. Finally, §[^ is reserved for open 
questions. We defer the proofs of several results, as well as some auxilliary material to the Appendix. 


2 Preliminaries 

The main object of study in this paper are {0,1}-matrices (corresponding to the execution matrix of an 
algorithm). For geometric reasons, we write matrix entries bottom-to-top, i.e. the first row is the bottom¬ 
most. Thus, is the value in the i-th column (left-to-right) and the j-th row (bottom-to-top) of M. 

We simultaneously view M as a, set of integral points in the plane. For a point p = {x, y), we say that p S M, 
if M{x,y) = I. We denote by p.x and p.y the x- and y-coordinates of point p respectively. 

Given a point set (matrix) M, let p and q be points not necessarily in M. Let Dpq be the closed rectangle 
whose two corners are p and q. Rectangle is empty if M n Dpg \ {p, q} = 0. Rectangle Dpg is satisfied 
if p,q have the same x or y coordinate, or there is another point r G M Cl Dpg \ {p,q}. A point set M is 
(arboreally) satisfied if Dpg is satisfied for any two points p,q G M. 


Geometric setting for online BST. We describe the geometric model for analyzing online BST algo¬ 
rithms (or simply online BST). This model is essentially the same as II , except for the fact that our model 
allows cost analysis of BST algorithms that start accessing an input sequence from arbitrary initial tree. 

For any access sequence X G [n]™, we also refer to X as an n-by-m matrix, called access matrix^ where 
there is a point {x,t) S A" iff an element x is accessed at time t. We call {x,t) the access point (or input 
point) at time t. We also refer to the x-th column of X as element x, and to the t-th row of X as time t. 

For any tree T of depth d, let d{x) be the depth of element x in T. We also refer to T as an n-hy-d matrix, 
called initial tree matrix, where in each column x, there is a stack of points (x, 1),..., (x, d — d(x) + 1) G T. 
See Figure for an illustration. Observe that matrix T is satisfied. 

An online geometric BST A accessing sequence X starting with initial tree T works as follows. Given a 
matrix [^] as input, A outputs a satisfied matrix where At{X) A A is an n-by-m matrix called 

the touch matrix of A on A with initial tree T. More precisely, A proceeds in an online fashion: At time 
t = I,..., n, the algorithm A has access to T as well as to the first t rows of matrix A, and it must decide on 
the t-th row of the output At (A) which cannot be changed afterwards. Algorithm A is feasible if 
is satisfied. 

The matrix At (A) can be thought of as the execution log of A, and points in At (A) are called touch 
points; the cost of A is simply w(At(A)), i.e. the number of points (ones) in At{X). 
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An online algorithm with preprocessing is an online algorithm that is allowed to perform differently at 
time t — 0. The linear-cost preprocessing used in this paper, as well as in DHIKP 11 , is to put all elements 
into a split tree (see for the details of this data structure). The consequences of this preprocessing are 
that (i) in the geometric view the initial tree matrix can be removed, i.e. an algorithm A only gets X as 
input and outputs A{X), and (ii) in the tree-view, an input initial tree T is turned into another tree T' 
independent of the input A, and then keys in X are accessed using the tree-view variant of Greedy. 

DHIKP showecj^that online geometric BSTs are essentially equivalent to online BSTs (in tree-view). 
In particular, given any online geometric BST A, there is an online BST A! where the cost of running A! on 
any sequence X with any initial tree T is a constant factor away from w{At{X)). Therefore, in this paper, 
we focus on online geometric BSTs and refer to them from now on simply as online BSTs. 

Unless explicitly stated otherwise, we assume that the access sequence A is a permutation, i.e. each 
element is accessed only once, and thus A and At{X) are n-by-n matrices. This is justified by the following 
theorem (proof in Appendix . 

Theorem 2.1. Assume that there is a c-competitive geometric BST A^ for all permutations. Then there is 
a ic-competitive geometric BST A for all access sequences^^ 


Greedy and signed Greedy. Next, we describe the algorithms Greedy and signed Greedy (de¬ 
noted as S Greedy). 

Consider the execution of an online BST A on sequence A with initial tree T just before time t. Let 
Ot-i denote the output at time t — 1 (i.e. the initial tree T and the first t — 1 rows of At{X)). Let 
T(b,t— 1) = maxjz | (&, *) S Ot-i} for any b S [n]. In words, T(hA— 1) is the last time strictly before t when 
b is touched, otherwise it is a non-positive number dehned by the initial tree. Let (a, t) G A be the access 
point at time t. We dehne stairt(a) as the set of elements b G [n] for which □(a,t),(b,T(b.t-i)) is not satisfied. 
Greedy touches element x € [n] if and only if a: G stairt(a). See Figure [^for an illustration of stairs. 




o 

• • 
• • • • 
• • • • 

.JO 

.o • • 

o • • • • • 


Figure 2: Tree-view and geometric view of initial tree. Search path and stair (at time 1) of element 5. 


We define stairLeftt(a) = {6 G stairt(a) | 5 < a} and stairRight((a) = {6 G stairt(a) | 6 > a}. The 
algorithms GreedyLeft and GreedyRight are analogous to Greedy, but they only touch stairLeftt(a), 
respectively stairRight((a) for every time t; SGreedy simply refers to the union of the GreedyLeft and 
GreedyRight outputs, which is not necessarily satisfied. It is known 11 18 that SGreedy constant- 
approximates the independent rectangle bound which subsumes the two previously known lower bounds on 
OPT, called Wilber’s hrst and second bounds [Ml. 


Theorem 2.2 ( [H ^^). For any BST A (online or offline) that runs on sequence X with arbitrary 
initial tree T, w(^ 7 ’(A)) = r2(SGREEDY(A)). 


A nice property of Greedy and SGreedy is that an element x can become “hidden” in some range 
in the sense that if we do not access in x's hidden range, then x will not be touched. Next, we formally 
define this concept, and list some cases when elements become hidden during the execution of Greedy and 
SGreedy. The concept is illustrated in Figure]^ 

®with some minor additional argument about initial trees from Appendix IBT| 

^®The statement holds unconditionally for offline algorithms, and under a mild assumption if we require and A. to be 
online. 
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Figure 3: After time t, element x is hidden in (u>, n] for Greedy, and in (w,*] for GreedyRight. After time t', 
element x is hidden in {w,y) for Greedy; it is easy to verify that any access outside of \u),y\ will never 
touch X. 


Definition 2.3 (Hidden). For any algorithm A, an element x is hidden in range [w^y\ after t if, given that 
there is no access point p G [w,y] x {t,t'] for any t' > t, then x will not be touched by A in time {t,t']. 

Lemma 2.4 (Proof in Appendix |A.1[ ). Let x be some element. 

(i) If there is an element w < x where T(w,t) > T{x,t), then x is hidden in {w,n] and (w,a;] after t for 
Greedy and GreedyRight respectively. 

(ii) If there is an element y > x where T{y,t) > T{x,t), then x is hidden in [1,2/) and [x,y) after t for 
Greedy and GreedyLeft respectively. 

(Hi) If there are elements w,y where w < x < y, and T{w,t),T{y,t) > T(x,t), then x is hidden in {w,y) 
after t for Greedy. 


Forbidden submatrix theory. Let M be an n-hy-m matrix. Given I C [n] and J C [m], a submatrix 
is obtained by removing columns in [n] \ I and rows in [m] \ J from M. 

Definition 2.5 (Forbidden submatrix). Given a matrix M and another matrix P of size cx d, we say that 
M contains P if there is a cx d submatrix M' of M such that M'(i,j) = 1 whenever P(i,j) = 1. We say 
that M avoids (forbids) P if M does not contain P. 

We often call an avoided (forbidden) matrix a “pattern”. For any permutation tt = (tti, ... ,7 r/;) G S^, 
we also refer to tt as a k-hy-k matrix where (tt^, i) G tt for all 1 < f < fc. Permutation matrices have a single 
one-entry in every row and in every column. Relaxing this condition slightly, we call a matrix that has a 
single one in every column a light matrix. We make use of the following results. 


Lemma 2.6. Let M a nd P be n-by -n and k-by-k matrices such that M avoids P. 

(i) (Marcus, Tar dos 25) Fox 14f If P is a permutation, the number of ones in M is at most 

(ii) (Nivasch 28’, Pettie 31’) If P is light, the number of ones in M is at most 


We define the tensor product between matrices as follows: M G G is the matrix obtained by replacing 
each one-entry of M by a copy of G, and replacing each zero-entry of M by an all-zero matrix equal in size 
with G. Suppose that M contains P. A bounding box i? of P is a minimal submatrix M\ij such that P is 
contained in B, and / and J are sets of consecutive columns, resp. rows. We denote min(/), max(/), min(J), 
max(J), as B.xmin, B.xmax, B.ymin, B.ymax, respectively. 


3 Greedy is input-revealing 


In this section, we prove Theorems |l.l[ [L4l |1.6| and 1.7 We refer to § |l.l| for definitions of pattern-avoiding, 
^-decomposable and fc-increasing permutations. 

To bound the cost of algorithm A, we show that At{X) (or A{X) if preprocessing is allowed) avoids 
some pattern. In the following, Incfc, Dec^, and Alt^ are permutation matrices of size k, and Cap is a light 
matrix. These matrices will be defined later. 




















Lemma 3.1. Let X he an access sequence. 

(i) If X avoids a permutation P of size k, then for any initial tree T, GreedyRighT 7 ’(^), GreedyLefT 7 ’(X) 
and GREEDY'r(-^) avoid P G (1, 2), P G (2,1) and P G Cap respectively. 

(ii) If X is k-increasing (k-decreasing), then for any initial tree T, GreedY 7 ’(X) avoids {k,..., 
f(l,..., fc) G Incfc+i). 

(Hi) If X is k-decomposable, then Greedy(X) avoids P G Altk+i for all simple permutations P of size at 
least fc + 1. 


The application of Lemma 2.6 is sufficient to complete all the proofs, together with the observation that, 
for any permutation matrix P of size fc x A:, if G is a permutation (resp. light) matrix of size £ x £, then 
P G G is a permutation (resp. light) matrix of size k£ x k£. 

The rest of this section is devoted to the proof of Lemma [XT] We need the definition of the input-revealing 
gadget which is a pattern whose existence in the touch matrix (where access points are indistinguishable 
from touch points) gives some information about the locations of access points. 


Definition 3.2 (Input-revealing gadgets). Let X he an access matrix, A he an algorithm and A{X) he the 
touch matrix of A running on X. 

- Capture Gadget. A pattern P is a capture gadget for algorithm A if, for any hounding box B of P in 
A{X), there is an access point p £ B. 

- Monotone Gadget. A pattern P is a fc-increasing (fc-decreasing) gadget for algorithm A if, for any 
bounding box B of P in A{X), either (i) there is an access point p G B or, (ii) there are k access 
points pi,...,pk such that B.ymin < pi.y < •■■ < pk.y < B.ymax and pi.x < ■■■ < pk-X < B.xmin 
{pi.x > ■ ■ ■ > pk-x > B.xmax). In words, pi,... ,Pk form the pattern (1,..., fc) (resp. (fc,..., 1 )). 
Alternating Gadget. A pattern P is a fc-alternating gadget for algorithm A if, for any hounding box B 
of P in A{X), either (i) there is an access point p G B or, (ii) there are fc access points pi,... ,Pk such 
that B.ymin < pi.y < ■ • • < Pk-U < B.ymax and B.xmax < pi.x for all odd i and pi.x < B.xmin for all 
even i. 


Notice that if we had a permutation capture gadget G for algorithm A, we would be done: If X avoids 
pattern Q, then At{X) avoids Q<S)G and forbidden submatrix theory applies. With arbitrary initial tree T, it 
is impossible to construct a permutation capture gadget for Greedy even for 2-decomposable sequences (see 
Appendix |G.l[ ). Instead, we define a light capture gadget for Greedy for any sequence X and any initial tree 
T. Additionally, we construct permutation monotone and alternating gadgets. After a preprocessing step, 
we can use the alternating gadget to construct a permutation capture gadget for fc-decomposable sequences. 

Lemma 3.3. (i) (12) is a capture gadget for GREEOYTiiGiiT and {21) is a capture gadget for GkeebyIiEFT. 

(ii) Cap is a capture gadget for Greedy where Cap = (,*,). 

(Hi) Inck+i (D&Ck+i) is a k-increasing (k-decreasing) gadget for Greedy where Inck = (fc,..., 1) and 
Decfc = (l,2,...,fc). 

(iv) Deck+i (inck+i) is a capture gadget for Greedy that runs on k-increasing {k-decreasing) sequence. 

(v) Altk+i is a k-alternating gadget for Greedy where Altk = ([(fc -b 1)/2J , fc, 1, fc — 1, 2,...). 

(vi) Altk +4 is a capture gadget for Greedy that runs on k-decomposable sequence without initial tree. 


For example. Alts ~ ^ * j ' emphasize that only result (vi) requires a preprocessing. 

Given Lemma |3.3[ and the fact that fc-increasing, fc-decreasing, and fc-decomposable sequences avoid 


(fc,..., 1), (1,..., fc), resp. all simple permutations of size at least fc -I- 1, Lemma 3.1 follows immediately. 


Proof. We only show the proofs for (i) and (ii) in order to convey the flavor of the arguments. The other 


proofs are more involved. See Appendix B.2 for proofs of (iii),(iv), and Appendix C.2 for proofs of (v),(vi). 


(i) We only prove for GreedyRight. Let a, b be touch points in GreedyRightt(A') such that a.y < b.y 
and form (1,2) with a bounding box B. As a.y = T{a.x,a.y) > T{b.x,a.y), by Lemma [2.4| (i), b.x is hidden 
in {a.x, b.x] after a.y. Suppose that there is no access point in B = □(!;,, then b cannot be touched, which is 
a contradiction. 
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(ii) Let a,b,c be touch points in GreedY 7 ’(X) such that a.x < b.x < c.x and a,b,c form Cap with 
bounding box B. As T{a.x,a.y),T{c.x,a.y) > T{b.x,a.y), by Lemma 2.4 (hi), b.x is hidden in (a.x, c.x) after 
a.y. Suppose that there is no access point in B, then b cannot be touched, which is a contradiction. I 


4 Decomposition theorem and applications 


In this section, we take a deeper look at decomposable permutations and investigate whether Greedy is 
“decomposable”, in the sense that one can bound the cost of Greedy on input sequences by summing over 
local costs on blocks. Proving such a property for Greedy seems to be prohibitively difficult (we illustrate 
this in Appendix G.ll. Therefore, we propose a more “robust” but offline variant of Greedy which turns 
out to be decomposable. We briefly describe the new algorithm, and state the main theorems and three 
applications, leaving the details to Appendix [D| 


Recursive decomposition. We refer the reader to §o for the definitions of blocks and recursive de¬ 
composition. Let S' be a fc-decomposable permutation access sequence, and let T be its (not necessarily 
unique) decomposition tree (of arity at most k). Let P be a block in the decomposition tree T of S. 
Consider the subblocks Pi, P 2 ,..., Pk of P, and let P denote the unique permutation of size [/c] that is 
order-isomorphic to a sequence of arbitrary representative elements from the blocks Pi, P 2 ,..., Pk- We de¬ 
note P = P[Pi, P 2 ,..., Pk] a deflation of P, and we call P the skeleton of P. Visually, P is obtained from 
P by contracting each block Pi to a single point. We associate with each block P (i.e. each node in both 
the corresponding rectangular region (in fact, a square), and the corresponding skeleton P (Figure llj). We 
denote the set of leaves and non-leaves of the tree T by L{T) and N(T) respectively. 

Consider the k subblocks Pi,...,Pk of a block P in this decomposition, numbered by increasing y- 
coordinate. We partition the rectangular region of P into k'^ rectangular regions, whose boundaries are 
aligned with those of Pi,... ,Pk. We index these regions as R(i,j) with i going left to right and j going 
bottom to top. In particular, the region R(i,j) is aligned (same ^-coordinates) with Pj. 


A robust version of Greedy. The main difficulty in analyzing the cost of Greedy on decomposable ac¬ 
cess sequences is the “interference” between different blocks of the decomposition. Our RGreedy algorithm 
is an extension of Greedy. Locally it touches at least those points that Greedy would touch; additionally 
it may touch other points, depending on the recursive decomposition of the access sequence (this a priori 
knowledge of the decomposition tree is what makes RGreedy an offline algorithm). The purpose of touching 
extra points is to “seal off” the blocks, limiting their effect to the outside. 

We now define the concept of topwing. The topwing of a rectangular region i? at a certain time in the 
execution of RGreedy contains a subset of the points in R (at most one in each column). A point is in 
the topwing if it forms an empty rectangle with the top left or the top right corner of R (this definition 
includes the topmost touch point of the leftmost and rightmost columns of R, if such points exist in R). 
When accessing an element at time t, RGreedy touches the projections to the timeline t of the topwings 
of all regions R(i,j) aligned with a block Pj ending at time t (this includes the region of Pj itself). Observe 
that multiple nested blocks might end at the same time, in this case we process them in increasing order of 
their size. See Figure]^ for an illustration of RGreedy. 


Theorem 4.1 (Decomposition theorem. Theorem D.2). For any decomposition tree P of a permutation P, 
the total cost of RGreedy is bounded as 


RGreedy(P) < 4 • ^ Greedy(.P) -I- ^ Greedy(P) -|- 3n. 

PGY(r) peL{r) 


Application 1: Cole et al.’s “showcase” sequences. This sequence can be defined as a permutation 
P = P[Pi,..., P„/ iog„] where each Pj is the permutation (1, ..., logn), i.e. a sequential access. It is known 
that Greedy is linear on sequential access, so GREEDY(Pj) = O(logn). Now, by applying Theorem 


4.1 


we 


can say that the cost of RGreedy on P is RGreedy(P) < 4 • Greedy(P) -I- J2j GREEDY(Pj) -I- 0(n) < 
■ ^°S(loen)) + DFn= 0{n). This implies the linear optima of this class of sequences. 
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Figure 4: Illustration of RGreedy. Left: region with points and topwing highlighted. Right: a sample run of 
RGreedy on a 2-decomposable input. Access points are dark circles, points touched by Greedy are gray 
circles, points touched by the RGreedy augmentation are gray squares. The access point in the topmost 
line completes the black square and also the enclosing gray square. The topwing of the black square 
consists of the black circle at position (1,6) and the gray circle at (3,5). Therefore the augmentation of 
the black square adds the gray square at (3,6). The topwing of R{2, 2) consists of the gray circle at (4,4) 
and hence the gray square at (4,6) is added. The topwing of the gray square consists of points (1,6) and 
(6,3); the point (4,4) does not belong to the topwing because (4,6) has already been added. Therefore, 
the gray square at (6,6) is added. 


Application 2: Simple permutations as the “core” difficulty of BST inputs. We prove that, if 
Greedy is c-competitive on simple permutations, then RGreedy is 0(c)-approximate for all permutations. 

To prove this, we consider a block decomposition tree T in which the permutation corresponding to every 
node is simple. Using Theorem|4.1|and the hypothesis that Greedy is c-competitive for simple permutations: 


RGreedy(P) < 4c • OPT(P) -be- ^ OPT(P) -b 3n. 

PeAf(r) PeL{T) 


Now we decompose OPT into subproblems the same way as done for RGreedy. We prove in Lemma pl).4| 
that OPT(P) > ^pg.y-OPT(P) —2n. Combined with Theorem |4^ this yields RGreedy(P) < 4c-0PT(P)-b 
(8c-b 3)n. Using Theorem 2.1 we can extend this statement to arbitrary access sequences. 


Application 3: Recursively decomposable permutations. Let A be a /c-decomposable permutation. 
Then there is a decomposition tree T of A in which each node is a permutation of size at most k. Each block 
P S L{T) U A(T) has Greedy(P) = 0(|P|log|P|) = 0{\P\\ogk). By standard charging arguments, the 
sum J2peN(r)uL{r) “ C>{n), so the total cost of RGreedy is 0(n log fc); see the full proof in Appendix [Pj 
We remark that the logarithmic dependence on k is optimal (Proof in Appendix |G.2[ ), and that Greedy 
asymptotically matches this bound when k is constant (Theorem |1.4[). 
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5 Discussion and open questions 


Besides the long-standing open question of resolving dynamic optimality, our work raises several new ones 
in the context of pattern avoidance. We list some that are most interesting to us. 

Can the bound of Theorem o be improved? While our input-revealing techniques are unlikely to yield 
a linear bound (Appendix |G.1[ ) , a slight improvement could come from strengthening Lemma |2.6[ ii) for the 
special kind of light matrices that arise in our proof. 

An intriguing question is whether there is a natural characterization of all sequences with linear optimum. 
How big is the overlap between “easy inputs” and inputs with pattern avoidance properties? We have 
shown that avoidance of a small pattern makes sequences easy. The converse does not hold: There is a 
permutation X G Sn with k{X) = but for which Greedy(A) = 0{n); see Appendix G.4 Note that 


our pattern-avoiding properties are incomparable with the earlier parametrizations (e.g. dynamic finger); see 
Appendix |G.4[ Is there a parameter that subsumes both pattern-avoidance and dynamic finger? 

A question directly related to our work is to close the gap between OPT = 0(nlog fc) and '> by 

Greedy on fc-decomposable sequences (when k = a;(l)). Matching the optimum (if at all possible) likely 
requires novel techniques: splay is not even known to be linear on preorder sequences with preprocessing, 
and with forbidden-submatrix-arguments it seems impossible to obtain bounds beyoncj^ 0 {nk). 

We proved that if Greedy is optimal on simple permutations, then RGreedy is optimal on all access 
sequences. Can some property of simple permutation^^ be algorithmically exploited? Can RGreedy be 
made online? Can our application for Cole’s showcase sequence be extended in order to prove the dynamic 
finger property for Greedy? 

Finally, making further progress on the traversal conjecture will likely require novel ideas. We propose 
a simpler question that captures the barrier for three famous conjectures: traversal, deque, and split (Ap¬ 
pendix G.5). Given any initial tree T, access a preorder sequence defined by a path P (a BST where each 
non-leaf node has a single child). Prove a linear bound for your favorite online BST! 
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An n-by-n matrix avoiding all permutations of size at least k can cont ain Q,{nk) ones. 
Note that simple permutations can have linear cost. In Appendix * 


G.4 


we show such an example. 
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A Warmup: A direct proof for preorder sequences 

In this section we show that the cost of Greedy on an arbitrary preorder access sequence is linear. In other 
words, we give a direct proof of Corollary |I.5[ We assume the preprocessing discussed in §[^ or equivalently, 
in geometric view we assume that Greedy runs with no initial trej^ i.e. initially all stairs are empty. The 
proof is intended as a warmup, to develop some intuition about the geometric model and to familiarize with 
the concepts of wings and hidden elements used in the subsequent proofs. We remark that an alternative 
proof of the same result (up to the same constants) can be obtained using forbidden submatrix arguments. 

Preorder sequences. The class PreOrd„ of preorder sequences of length n can be defined recursively as 
follows. A permutation sequence X = (xi,... ,x„) belongs to PreOrd„ if (i)|A| = n = I or (ii) there is a 
number n' < n, such that 

• L{X) = (x 2 ,..., x„/) e PreOrdn'-i, 

• R{X) = {xn'+i, ■■.,Xn) & PreOrdn_n', 

• max{L(A)} < xi < min{i?(A)}. 

We remark again the well-known fact that preorder sequences are exactly the (2,3, l)-free permutations, 
and the easily verifiable fact that preorder sequences are 2-decomposable (see Figurefor an illustration). 



Figure 5: Preorder sequence with sets L and R, with wingj^{a) highlighted. Observe that point a together with L 
can form a single block. 

We prove the following theorem. 

Theorem A.l. Let X G PreOrd„. Then Greedy(A) < 4n. 

A.l Wings and hidden elements 

Let X G PreOrdp. For an arbitrary element a G X, let G denote its access time. Let La denote the maximal 
contiguous subsequence of X immediately after a, that consists of elements smaller than a. Let Va G X 
be the smallest element that is larger than a and precedes a in A. By convention, Tq is oo if there are no 
elements larger than a that precede a in A. Let Ra denote the maximal contiguous subsequence after La of 
elements larger than a and smaller than r^. 

Let T~a be the concatenation of La and Ra- Let lca(b, c) denote the last element a, such that &, c S Ta- 
We define wingL{a) to contain all elements b G La such that the rectangle with corners (6 ,tb), {a,ta) 
contains no other access point. Symmetrically, we define wingn^a) to contain all elements b G Ra such that 
the rectangle with corners (6,4), (a, to) contains no other access point (Figure [^. 

Lemma A. 2. The sets {wingL{a)}a^[n\ disjoint. Similarly, the sets {wingfi{a)}^^^^-^ are disjoint. 

^^Curiously, in the case of Greedy and preorder sequences this has the same effect as if the initial tree would be exactly the 
tree that generated the input sequence. 
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Proof. Assume for contradiction that there are two elements a, a' G [n] such that wingr^a) nwingL(a') ^ 0. 
Assume w.l.o.g. that a < a'. By definition, wingL{a) only contains elements in La, so there must be an 
element b G La Cl La', such that b < a < a' and tb > ta and 4 > t'a- There are two possibilities: either 
ta > t'a, contradicting the fact that the rectangle with corners {a', t'a), {b,tb) is empty, or 4 < t'a, in which 
case the subsequence {a,a',b) is order-isomorphic to (2,3,1), contradicting the fact that A is a preorder 
sequence. I 

The reader might find it instructive to see the corresponding situation in tree-view. Figure]^ shows a proof 
by picture in the tree corresponding to the preorder sequence X. 



Figure 6: BST corresponding to a preorder sequence. Left wings of elements indicated as shaded rectangles. 


Corollary A.3. -I- Iwingnia)]) < 2n. 

Next we restate and prove Lemma [2.4| about hidden elements. Recall that T{x,t — 1) denotes the last 
time strictly before t when x is touched. 


Definition A.4 (Definition 2.31 . For any algorithm A, an element x is hidden in range [w,y] after t if, 
given that there is no access point a G [tc, y] x {t, t'] for any t' > t, then x will not be touched by A in time 
{t,t']. 


Lemma A.5 (Lemma |2.4[ ). Let x he some element. 

(i) If there is an element w < x where T(w,f) > T{x,t), then x is hidden in {w,n] and (w,a;] after t for 
Greedy and GreedyRight respectively. 

(a) If there is an element y > x where T{y,t) > T{x,f), then x is hidden in [l,y) and [x,y) after t for 
Greedy and GreedyLeft respectively. 

(Hi) If there are elements w,y where w < x < y, and T{w,t),T{y,t) > T(x,t), then x is hidden in {w,y) 
after t for Greedy. 


Proof. (i) Consider any t' > t. For the case of Greedy, assume that there is no access point in {w, n] x 
{t,t']. Suppose X is touched in the time interval {t,t'], and let (p,tp) be the first access point in this 
time interval that causes the touching of x. Then p G [1, w], and tp G (t, t']. As x is not touched in the 
time interval (t, tp) by the choice of p, we have that t(x, tp — 1) = t{x, t). If t{w, t) > t(x, t), then the 
rectangle with corners (p, tp), and {x, t{x, tp — 1)) contains the point {w, t(w, t)), and thus it is satisfied 
before accessing p. Therefore, the accessing of p via Greedy does not touch x, a contradiction. It 
follows that X is hidden in {w, n] for Greedy after t. 

For the case of GreedyRight, assume that there is no access point in {w,x] x {t,t']. Suppose x is 
touched in the time interval {t,t'], and let {p,tp) be the first access point in this time interval that 
causes the touching of x. Then p G [l,w], and tp G {t,t']. (The case p G {x,n] is not possible for 
GreedyRight, since p < x must hold.) The remainder of the argument is the same as for Greedy. 
(ii) The argument is symmetric to (i). 
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(iii) Consider any t' > t. Assume that there is no access point in {w, y) x {t, t’]. Suppose x is touched in the 
time interval (t, t'], and let (p, tp) be the first access point in this time interval that causes the touching 
of X. There are two cases. If p G [1, w], we use the argument of (i). If p G [y,n], we use the argument 
of (ii). 


A.2 Bounding the cost of Greedy 

Lemma A. 6 . For any element c G La\wing]^{a), a is not touched when accessing c using Greedy. Similarly 
for c G i?a \ wingji{a). 


Proof. There must be an access point 6, such that c < b < a and ta < tb < tc, for otherwise c would be in 
wingi^{a). Since T(b,tb) > T(a,tb), it follows that a is hidden in {b,n\ after tb- All accesses in the interval 
{tb,tc] are in [c, 6). Hence, a is not touched. I 

Corollary A.7. Let a be any element. Greedy touches a when aecessing elements in Ta at most \wingL{a)\ + 
\wingii{a)\ times. 

Lemma A.8. Let a be any element. Greedy touches a when aceessing elements outside Ta at most once. 


Proof. No element is touched before it is accessed. Since the elements in Ta are accessed consecutively, we 
only need to consider accesses after all elements in Ta have been accessed. All such accesses are to elements 
to the right of Ta- Let c be the first element accessed after the last access to an element in Ta and let 
b = lca{a, c). Then a G Lb, and c is the first element in Rb. 

Since T contains all elements in (&, c]. Greedy does not touch any element in (6, c] before time tc. Let 
d be the element in (a, 5] with largest time T{d,t(, — 1). If there are several such d, let d be the rightmost 
such element. Since no element in (b,c] is accessed before time tc, d is also the rightmost element in (a, c] 
with largest time T(d,tc — 1). The rectangle with corners {d,T{d,tc — 1)) and (c,tc) contains no other point 
and hence d is touched at time tc- 

Clearly, T(a, G) ^ tc = T{d,tc)- Thus a is hidden in [I,c?) after time tc and hence a is not touched after 
time tc- I 


Corollary A.7 and Lemma A.8 together 
\wingR{a)\ +1 times, so the total cost is at 
the proof of Theorem |A.I[ 


imply that each element a is only touched at most \wingLia) \ + 
most + I'^^tng]i{a)\ + 2) < 4n. This concludes 


B Access sequences with initial trees 

B.l Initial trees. 

In the BST model, the initial tree is the state of the BST before the first element is accessed. In the tree-view 
of BST execution, we know the structure of the tree exactly in every time step, and hence the structure 
of the initial tree as well. This is less obvious, however, in the geometric view. In fact, according to the 
reduction of DHIKP, the initial tree is always a split tree [II] , hidden from us in geometric view. We show 
how to “force” an arbitrary initial tree into the geometric view of BST execution. 

First, we observe a simple fact. Given a binary search tree T, we denote the search path of an element a 
in T as path(a,T). It is easy to show that two BSTs T and T 2 , containing [n], have the same structure iff 
path(a,Ti) = path(a,T2) as a set, for all a G [n]. 

The stair of an element is, in some sense, the geometric view of the search path. Indeed, path(a, T) = {b \ b 
is touched if a is accessed in T}. Similarly, stairt(a) = {6 | 6 is touched if a is accessed at time t}. Thus, to 
model an initial tree T into the geometric view, it is enough to enforce that stairi(a) = path(a,T). 

We can achieve this in a straightforward way, by putting stacks of points in each column below the first 
row, corresponding to the structure of the initial tree T. More precisely, in column x we put a stack of height 
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d — d{x) + 1, where d{x) is the depth of element x in T, and d is the maximum depth of an element (Figure 
[^. One can easily show that stairi(a) = path(a,T) for all a G [n]. 

We remark that the points representing the initial tree are not counted in the cost. 


B.2 fc-increasing sequences 

We prove Lemma |3.3| (iii) and (iv). 

Lemma B.l (Lemma |3.3[ iii)). Jncfe+i is a k-increasing (k-decreasing) gadget for Gke^dy where 

Inck = (fc,..., 1) and Decj, = (1, 2,, k). 


Proof We only prove that Incfc+i is a /c-increasing gadget. Let qo,... ,qk be touch points in GreedY 7 ’(X) 
such that qi.y < qt+i-y for all i and form Inc^^+i with bounding box B. Note that, for all 1 < i < A:, we can 
assume that there is no touch point q[ where q[.x = qi.x and qt-i-y < q[.y < qi.y, otherwise we can choose 
g' instead of qt. So, for any t G [qi-i.y,qi.y) we have that T{qi-i.x,t) > T{qi.x,t) for all 1 < * < A:. Suppose 
that there is no access point in B. 

By Lemma |2.4| (ii), qi.x is hidden in [1, q^.x) after qp-y. So there must be an access point pi G [l,B.xmin)x 
{qQ.y,qi.y\, otherwise qi cannot be touched. We choose such pi where pi.x is maximized. If pi.y < qi.y, 
then T(pi.x,pi.y), T(qQ.x,pi.y) > T{qi.x,pi.y). So by Lemma 2.4 (iii), qi.x is hidden in (pi.x, q^.x) after pi.y 
and hence qi cannot be touched, which is a contradiction. So we have pi.y = qi.y and pi.x < B.xmin. 

Next, we prove by induction from i = 2 to A; that, given the access point pi-i where Pi-i.y = qi-i.y and 
Pi-i.x < B.xmin, there must be an access point pi where pi.y = qi.y and Pi-i.x < pi.x < B.xmin. Again, 
by Lemma [ 2 ( 4 ] (iii), there must be an access point pi G (pi-i.x, qi-i.x) x (qi-i.y, qi.y] otherwise qi cannot be 
touched. We choose the point Pi where Pi-x is maximized. If Pi.y < qi.y, then T{pi.x,Pi.y),T{qi_i.x,Pi.y) > 
T[qi.x,pi.y). So by Lemma 2.4 (iii), qi.x is hidden in (jpi.x, qi-i.x) after pi.y and hence qi cannot be touched. 
So we have pi.y = qi.y and Pi-i.x < pi.x < B.xmin. Therefore, we get pi.x < • ■ ■ < pk.x < B.xmin which 
concludes the proof. I 


Lemma B.2 (Lemma |3.3[ iv)). Deck+i (inck+i) is a capture gadget for Gbsedy that runs on k-increasing 
{k-decreasing) sequence. 


Proof We only prove that Incfc+i is a capture gadget for Greedy running on A;-decreasing sequence X. 
Since Incfc+i is an increasing gadget, if Inc^+i appears in GreedY 7 ’(W) with bounding box B, then either 
there is an access point in B or there are access points forming (1,2,..., A:). The latter case cannot happen 
because X avoids (1, 2,..., k). So Inc^+i is a capture gadget. I 


C Access sequences without initial trees: /c-decomposable permu¬ 
tations 

C.l Pattern avoidance and recursive decomposability 


We restate and prove Lemma |l.3| that connects the A:-decomposability of a permutation with pattern avoid¬ 
ance properties. 


Lemma C.l (Extended form of Lemma 1.3). Let P he a permutation, and let k be an integer. The following 
statements are equivalent: 

(i) P is k-decomposable, 

(a) P avoids all simple permutations of size at least A: -|- I, 

(iii) P avoids all simple permutations of size k 1 and k 2. 
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Proof, [ii) 
(iii) = 


(Hi) is obvious. 


(ii) follows from the result of Schmerl and Trotter 33 that every simple permutation of length 
n contains a simple permutation of length n — 1 or n — 2, and the simple observation that if P avoids Q, 
then P also avoids all permutations containing Q. 

{a) {i) follows from the observation that a permutation P contains all permutations in its block 

decomposition tree Tp. Further, it is known that every permutation P has a block decomposition tree 
Tp in which all nodes are simple permutations. If P contains no simple permutation of size /c + 1 or more, 
it must have a block decomposition tree in which all nodes are simple permutations of size at most k, it is 
therefore, fc-decomposable. 

(i) => {ii): we show the contrapositive ~^{ii) => ^(*). Indeed, if P contains a simple permutation 
Q of size at least k + 1, then any /c-decomposition of P would induce a fc-decomposition of Q, contradicting 
the fact that Q is simple. I 


C.2 ^-decomposable permutations 

We prove Lemma [3?3| (v) and (vi). We refer to §[^for the necessary definitions. We show an example of the 
gadget Altfei 

Alt;7 = 


Lemma c.2 (Lemma |3.3[ v)). Alt^+i is a k-alternating gadget for Gkeeby where Altk = (L(fc+ 1)/2J ,k,l,k- 

1 , 2 ,...). 

Proof Let Qo, - ■ ■ ,qk be touch points in Greedy(X) such that Qi.y < qi+i-y for all i and form Altfc_|_i with 
bounding box B. Suppose that there is no access point in B. 

We prove for i = 1,..., fc, that there exists an access point pi S {B.xmax, n] x {qi-i.y, qi.y] for odd i, and 
Pi G [1, B.xmin) x (qi-i.y, qi.y] for even i. 

For odd i, by Lemma 2.4 (i), qi.x is hidden in (qi-i.x,n] after qi-i.y. So there must be an access point 
Pi G {B.xmax,n] x {qi-i.y,qi.y], otherwise qi cannot be touched. 

For even i, by Lemma 2.4 (ii), qi.x is hidden in [l,qi_i.x) after qi_i.y. So there must be an access point 
Pi G [1, B.xmin) x {qi-i.y,qi.y], otherwise qt cannot be touched. 

Observe that pi,... ,pk satisfy B.ymin < pi.y < • ■ ■ < pu.y < B.ymax and B.xmax < pi.x for all odd i 


and Pi.x < B.xmin for all even i, fulfilling the condition of Definition 3.2 Hence, Altfc+i is a fc-alternating 
gadget. 


Lemma C.3 (Lemma |3.3[ vi)). Altk+A is a capture /or Greedy that runs on k-decomposable sequence 

without initial tree. 

Proof. Let X be a ^-decomposable permutation. Since Alt/j +4 is a (fc -|- 3)-alternating gadget, if Altfc +4 
appears in Greedy(X) with bounding box B, then either there is an access point in B or there are access 
points Pi,P 2 , ■ ■ ■ ,Pk+ 3 , satisfying the conditions B.ymin < pi.y < ■ ■ • < Pk+s-y < B.ymax and B.xmax < 
Pi.x for odd i and pi.x < B.xmin for even i. Suppose there is no access point in B. 

Let qo,qi, ..., qk +3 denote the points that form Altfe +4 (where qo.y < qi.y < • ■ ■ < qk+ 3 .y), and let us 
denote V = {pi,... ,pk+ 3 }. Let T be a block decomposition tree of X of arity at most k. We look at each 
block P gT as a minimal rectangular region in the plane that contains all access points in the block. 

Let P* be the smallest block in T that contains two distinct access points Pi,Pj G V such that i is odd, 
and j is even. (Observe that pi is to the right of B, and Pj is to the left of B.) 

Observe first that the bounding box of P* must contain or intersect both vertical sides of B, otherwise it 
could not contain points on both sides of B. Furthermore, the bottom horizontal side of B must be contained 
entirely in the bounding box of P*: If that were not the case, then there would be no accesses below B, 
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and (Jo (the lowest point of Altfc_|_ 4 ) would not be touched by Greedy (since there is no initial tree). The 
following is the key observation of the proof. 

Lemma C.4. Let k' be the largest integer such that P* contains qo,. ■ ■ ,qk'- Then, k' < k + 1. In words, 
P* contains at most the k + 1 lowest points of Altk+i- 

Proof. Let P* = P{Pi,... ,Pk) be the decomposition of P* into at most k blocks. We claim that each block 
Pj can contain at most one point from V. First, by the minimality of P*, each block Pj never contains 
two access points from V from different sides of B; and Pj cannot contain two points from V from the 
same side of B either, because assuming otherwise that Pj contains two points from V from the left of 
B, it would follow that there is another access point (a, t) € 7^ on the right of B outside of Pj such that 
{Pj).ymin < t < (Pj).ymax, contradicting that Pj is a block. 

Observe further that, except for the block containing pi, each block Pj cannot overlap with B. Since we 
have at most k such blocks in the decomposition, and because qi.y > pi.y for all i, it follows that the top 
boundary of P* must be below qk+i- 


Since the bounding box of P* contains (at best) the first fc+1 points of Altfc+ 4 , the three remaining points 
of Altfc _|_4 need to be touched after the last access in P*. In the following we show that this is impossible. 

Let {L,tL) be the topmost access point inside P* to the left of B, such that there is a touch point inside 
B at time Let {R,tii) be the topmost access point inside P* to the right of B, such that there is a touch 
point inside B at time tjj. Let L' be the rightmost element inside B touched at time tj,, and let R' be the 
leftmost element inside B touched at time of tji. 

Lemma C.5. After time tL, within the interval [B.xmin, L'], only L' can be touched. Similarly, after time 
tfi, within the interval [R', B.xmax], only R' can be touched. 


Proof. Let L” be the rightmost touched element in [L, B.xmin) at time tj^, and let R” be the leftmost 
touched element in [B.xmax, R\ at time tn. 

We have that t[L" ,tif),T[L',tL) > T[x,t]f), for any x G [B.xmin, L'). Thus, any x G [B.xmin, L') is 
hidden in [L",L') after t^. 

Above P* there can be no access to an element in [V,L'\, since that would contradict the fact that P* 
is a block. Within P* after t^ there can be no access to an element in [L”, B.xmin), as the first such access 
would necessarily cause the touching of a point inside B, a contradiction to the choice of {L,tL). An access 
within B is ruled out by our initial assumption. 

Thus, there is no access in (L", L') x [t^, n], and from Lemma [2)4| (iii) it follows that any x G [B.xmin, L') 
can not be touched after time t^. 

We argue similarly on the right side oi B. I 


As a corollary of Lemma C.5 note that above P* only elements within [L', R'] can be touched within B. 
If L' = R', then only one element can be touched, and we are done. Therefore, we assume that L' is strictly 
to the left of R'. 

We assume w.l.o.g. that tn > t^ (the other case is symmetric). Let [Q,tQ) be the first access point left 
of B with tq > ta such that there is a touch point inside B at time tq. Observe that [Q,tq) is outside of 
P* by our choice of [L,tL). If no such Q exists, we are done, since we contradict the fact that Altfc +4 is a 
[k + 3)-alternating gadget. 

Let [Z,tz) denote the last touch point with the property that Z G [[P*).xmin, L’), and tz G [tL,tq). 
If there are more such points in the same row, pick Z to be the leftmost. Note that [Z,tz) might coincide 
with [L,tL). We have tz < tn because otherwise [Q,tq) cannot exist. Note than tz > ta would imply that 
elements in [L', R'] are hidden in [Z, R) after t^. 

Next, we make some easy observations. 


Lemma C.6. The following ranges are empty: 
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(i) [B.xmin, B.xmax\ x [tfi + l,iQ — 1], 

(a) [1,Z - 1] X [tz + 

(Hi) [Q, Z - 1] X [tz + 1, tq -1], 

(iv) [Q,R' - 1] X [tR + l,tQ- 1], 

Proof. (i) It is clear that there can be no access point inside B by assumption. Suppose there is a touch 
point in [B.xmin, B.xmax] x [Ir + l,tQ — 1], and let (Q' ,tQi) be the first (lowest) such point. Denote 
the access point at time tgi as {Q'',tQi). Clearly tgi > {P*).ymax must hold, otherwise the choice 
of {L,tL) or {R,tR) would be contradicted. Also Q” > {P*).xmax must hold, otherwise the choice of 
{Q,tQ) would be contradicted. 

But this is impossible, since Q' G [L',R'], and t{Q', tQi — 1) < tfl, and thus the rectangle with corners 
[Q", Iq’) and {Q', t{Q', Iq' — 1)) contains the touch point {R, tR), contradicting the claim that Greedy 
touches Q' at time tQ>. 

(ii) All elements in [1,Z — 1] are hidden in [1, Z — 1] after tz- There can be no access points on the left of 
P*, due to the structure of the block decomposition, and there is no access point in [{P*).xmin, Z —1] x 
[tz + 1, tg — 1] due to the choice of Z. Hence there cannot be a touch points in [1, Z — 1] x + 1, ^r]- 

(hi) First there is no access point in [Q,Z — 1] x [tz + 1, {P*).ymax] due to the choice of {Z,tz) and the 
structure of block decomposition. Also, there is no access point in [Q, Z — 1] x {{P*).ymax,tQ — 1]: 
Assume there were such an access point (Q^^Q')■ Then it must be the case that Q < Q' < {P*).xmin 
and {P*).ymax < tqi < tq. Any rectangle formed by {Q,tq) and a point in P* n i? would have 
contained {Q',tqi), a contradiction to the fact that Greedy touches a point inside B at time tq. 

Next, we argue that there is no non-access touch point (a, t) G [Q, Z — 1] x [tz -I-1, tg — 1]. There are 
three cases. 

- {a,t) G [{P*).xmin, Z — 1] x [tz + l,tg — 1] contradicts the choice of {Z,tz). 

- (a, t) G [Q, {P*).xmin) x [tz + 1, [P*).ymax\ contradicts the fact that all elements in [1, Z) are hidden 
in [1, Z) after tz, and there is no access in [1, Z) in the time interval [tz + 1, {P*).ymax], since P* 
is a block. 

- (a,t) € [Q, {P*).xmin) x {{P*).ymax,tq — 1], contradicts the claim that at time tq Greedy touches 
a point inside B, since any rectangle formed by {Q, tq) and a point in P* H i? would have contained 
{a,t). 

(iv) Given the previous claims, it remains only to prove that there is no touch point (a, t) G [Z, B.xmin) xt G 
\tR + \,tq — V\. There cannot be such a touch point for t < {P*).ymax due to the choice of {Z,tz). 
For t > {P*).ymax, there cannot be souch an access point due to the structure of block decomposition. 
Remains the case when {a,t) is a non-access touch point in [Z, B.xmin) x {{P*).ymax,tq — 1. This is 
also impossible, as any rectangle formed by {Q, tq) and a point in P* H P would have contained (a, t), 
contradicting the choice of {Q,tq). 


Lemma C.7. At time tq we touch R'. Let L* be the leftmost element touched in [L',R'] at time tq (it is 
possible that L* = R'). After time tq, only L* and R' can be touched within [L',R']. 

Proof. The first part follows from the emptiness of rectangles in Lemma |C.6| That is, the lemma implies 
that the rectangle formed by {Q,tq) and [R',tR) is empty, so Greedy touches R' at time tq. 

The elements a G {L*,R') cannot be touched because they are hidden in {L*,R') after tq, and there 
is no access point in this range after tq, due to the structure of the block decomposition. Suppose that 
some element a G [L',L*) is touched (for the first time) at some time t > tq hy accessing element x. So 
the rectangle formed by {x,t) and {a,T{a,t — 1)) is empty, and we know that T{a,t — 1) < tR. Notice that 
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Figure 7: Illustration of the proof of Lemma 


3.3 


X < {P*).xmin for otherwise would have been in the rectangle formed by {x,t) and {a,T{a,t— 1)), a 

contradiction. Furthermore, t ( o , t— 1) > tz for otherwise, (Z, tz) would have been in the rectangle formed by 
{x,t) and (o,r(a,t — 1)), a contradiction. Now, since T(a,t— 1) > tz, Lemma C .6 implies that the rectangle 
formed by (Q, tq) and (a, r(a, t — 1)) must be empty, therefore a is touched at time tg; this contradicts the 
choice of L*. 


Lemma [C.4[ C.5 and |C.7| together imply that in total at best the fc + 3 lowest points of Altfc +4 can be 
touched (out of the k + A needed). This means that our assumption that B is free of access points was false. 
Therefore Altfc _|_4 is a capture gadget, finishing the proof of Lemma 3.3 Vi). See Figure]^ for an illustration. 


D Decomposition theorem and applications 

D.l Preliminaries 

Given a permutation tr : [n] —>■ [u], we call a set [a, b] x [c, d], with the property that {o'(a), a{a+l ),..., a{b)} = 
[c,d], a block of a induced by the interval [a,b]. Let P be a block induced by interval I. Consider a 
partitioning of I into k intervals, denoted (left-to-right) as Ii, I 2 , ■ ■ ■, Ik, such that each interval induces a 
block, denoted respectively as Pi, P 2 ,..., Pk- Consider the unique permutation P of size [k] that is order- 
isomorphic to a sequence of arbitrary representative elements from the blocks Pi, P 2 , ■ ■ ■, Pk- We denote 
P = P[Pi,P 2 , • ■., Pk] a deflation of P, and we call P the skeleton of P. Visually, P is obtained from P 
by contracting each block Pi to a single point (Figure Deflation is analogously defined on permutations. 
Every permutation a has the trivial deflations a = cr[l,..., 1] and a = l[cr]. Permutations that have only 
the trivial deflations are called simple permutations. 

A block decomposition tree T of cr is a tree in which each node is a block of a. We define T recursively 
as follows: The root of T is the block [n] x [n]. A node P of T is either a leaf, or it has children Pi, P 2 ,... 
corresponding to a nontrivial deflation P = P[Pi, P 2 ,... ]. To every non-leaf block P we associate its skeleton 
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P. For convenience, we assume that the blocks Pi,... ,Pk are ordered by their y-coordinates from bottom 
to top. We partition P into rectangular regions whose boundaries are aligned with Pi,...,Pk, and 
index them R{i,j) with i going left to right and j going bottom to top. In particular, the region R(i,j) 
is aligned with Pj (same y-coordinates). To every leaf block P = [a,b] x [c, d] we associate the unique 
permutation that is order-isomorphic to (cr(a), a(a -|- 1),..., crib)). When there is no chance of confusion, we 
refer interchangeably to a block and its associated permutation. We denote the leaves and non-leaves of the 
tree T by L{T) and N{T) respectively. 

A permutation is k-decomposable if it has a block decomposition tree T in which every node has an 
associated permutation of size at most k. As an example, we observe that preorder sequences of binary 
search trees are 2-decomposable. 

D.2 A robust Greedy 

We augment Greedy with some additional steps and we obtain an offline algorithm (called RGreedy) with 
several desirable properties. We remark that in this section we always assume the preprocessing discussed 
in §[^ or equivalently, in geometric view we assume that RGreedy is executed without initial tree (since 
RGreedy is offline, this is a non-issue). 

Before describing the algorithm, we give some simple definitions. 

We denote the top left, respectively top right corner point of a rectangular region R as topleft{R), 
respectively topright{R). We define topwing{R) to be a subset of the points touched in R, that are the last 
touch points of their respective columns. A point q is in topwingiR) if at least one of the following properties 
is satisfied: 

- The rectangle with corners q and topleft{R) is empty, 

- The rectangle with corners q and toprightiR) is empty, 

- y is in the leftmost or rightmost column of R. 

Note that the set topwingiR) depends on the time point where we are in the execution of the algorithm, but 
for simplicity of notation, we omit the reference to a specific time (it will always be clear from the context). 

We can now describe how RGreedy processes an input point (a,t). First, it acts as Greedy would, i.e. 
for every empty rectangle with corners (a,<) and ib,t'), it touches the point (5, t) on the current timeline. 
Let Yt be the points touched in this way (including the point (a,t)). Second, it touches additional points on 
the timeline. Let vi, V 2 = parentivi), ..., vi = parentivi-i) be the nodes of the decomposition tree such 
that the blocks associated with these nodes end at time t. Note that vi is a leaf. Let = parentivi); 
exists except when the last input is processed. For h G [2, £-|-1] in increasing order, we consider the deflation 
P[Pi,, Pk] associated with v^. Let (a, t) lie in Pj; note that j = k if h < i and j < k for h = £ + 1. For 
every sibling region Rii,j) of Pj, RGreedy touches the projection of topwingiRii, j)) on the timeline t. 
Let Yhg be the points touched. Figure [^illustrates the definition of RGreedy. 

We need to argue that the augmentation step does not violate the feasibility of the solution. 

Lemma D.l. The output o/RG reedy is feasible. 

Proof. Suppose for contradiction that at time t we have created an unsatisfied rectangle r with corners (a, t) 
and ix,y), where y < t. The point (a, t) is the result of projecting (a,t') onto the horizontal line t, where 
ia,t') G topwingiR), for some region R processed at time t. Clearly t' < t. Assume that before adding (a,t), 
the rectangle formed by the points (a, T) and topleftiR) is empty. A symmetric argument works if (a, t') 
formed an empty rectangle with toprightiR). We have y > t', for otherwise the point (a, t') would satisfy r. 
Let R = Rii,j). Since ix,y) lies in the time window of R, we have (x, y) G i?(i',y) for some i'. 

Suppose first that i = i'. If x < a, then the rectangle with corners (a, t') and topleftiR) contains (x,y), 
contradicting the fact that (a, f) was in topwingiR) before the augmentation. Therefore x > a holds. But 
then, if r is not satisfied, then the rectangle with corners (x, y) and topleftiR) had to be empty before the 
augmentation, and therefore (x, t) must have been touched in the augmentation step, making r satisfied. 

The case i ft %' remains. If %' < i, then r contains the rectangle formed by (x, y) and the top right corner 
of Rft',j). Either this rectangle is nonempty, or (x,y) was in topwingiRii', j)) before the augmentation, in 
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which case (x, t) was touched in the augmentation step. In both cases r must be satisfied. The case when 
i' > i is symmetric, finishing the proof. 


Observe that RGreedy processes the input and emits the output line-by-line, in increasing order of time. 
It is, however, not an online algorithm, as it has a priori access to a block decomposition tree of the input. 


D.3 Decomposition theorem 

Our main technical contribution in this section is the following result: 

Theorem D.2 (Decomposition theorem for permutations). For any block decomposition tree F of a permu¬ 
tation P, the total cost of RGreedy is bounded as 

RGreedy(P) < 4 • ^ Greedy(P) -I- ^ Greedy(P) -|- 3n. 

PGJV(r) PeL{T) 

We discuss applications of Theorem |D.2| 


Application 1: Cole et al.’s “showcase” sequences. In Cole et al. 10 , a showcase sequence is defined 
in order to introduce necessary technical tools for proving the dynamic finger theorem. This sequence can 
be defined (in our context) as a permutation P = P[Pi, ■ • ■, Pn/iogn] where each Pj is a sequence for 
which Greedy has linear linear access cost 
applying Theorem 


D.2 


i.e. \Pj\ = logn, and GREEDY(Pj) = O(logn). Now, by 
we can say that the cost of RGreedy on P is RGreedy(P) < 4 • Greedy(P) -|- 
Greedy j^O(logn)-\-0{n) =0{n). This implies that our algorithm 
has linear access cost on this sequence. 


Application 2: Simple permutations as the “core” difficulty of BST inputs. We argue that 
whether RGreedy performs well on all sequences depends only on whether Greedy does well on simple 
permutation sequences. More formally, we prove the following theorem. 


Theorem D.3. If Greedy is c-competitive on simple permutations, then RGreedy is an 0(1) approxi¬ 
mation algorithm for OPT(A) for all permutations X. 


Using Theorem |2.1| we can extend this statement from all permutations to all access sequences. 

To prove Theorem D.3 we use of the known easy fact that every permutation P has a block de¬ 
composition tree T in which all permutations in P(T) and all permutations in N{P) are simple. Applying 
Theorem |D.2| and the hypothesis that Greedy is c-competitive for simple permutations, we get the bound: 


RGreedy(P) < 4c • OPT(P) -he- ^ OPT(P) -H 3n. 

PGAf(r) peL{r) 


Now we need to decompose OPT into subproblems the same way as done for RGreedy. The following 
lemma finishes the proof (modulo the proof of Theorem D.2), since it implies that RGreedy(P) < 4c • 
OPT(P) -h (8c-P3)n. 


Lemma D.4. OPT(P) > OPT(P) — 2n. 


Proof. We show that for an arbitrary permutation P = P[Pi,..., Pk], it holds that OPT(P) > OPT(P) -|- 
Sfci OPT(Pi) — k. The result then follows simply by iterating this operation for every internal node of the 
block decomposition tree T of P and observing that the numbers of children of each internal node add up 
to at most 2n. 
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Let Y* be the set of points of the optimal solution for P. We partition Y* into sets Y* contained 
inside each block Pi, and we let Y*^^ = Y* \ Y*. Each Y* must clearly be feasible for Pi, so we have 

|y.*| > OPT(Pi). It remains to show that \Y*^^\ > OPT{P) — k. 

Again, consider the partitioning of P into rectangular regions R{i,j), aligned with the blocks Pi,... ,Pk- 
Consider the contraction operation that replaces each region R{i,j) with a point (z, j) if and only if y*ni?(z, j) 
is nonempty. We obtain a point set Y' S [k] x [fc], and clearly |E'| < \Y*^^ \ + |.P|. We claim that the obtained 
point set Y' is a satisfied superset of P, and therefore \Y*^^\ > OPT(.P) — k. 

Indeed, suppose for contradiction that there is an unsatisfied rectangle with corner points p,q € Y'. 
Suppose p is to the upper-left of q (the other cases are symmetric). Let Rp and Rq be the regions before 
the contraction that were mapped to p, respectively q. Since Y* n Rp is nonempty, we can consider p' to be 
the lowest point in Y* H Rp and if there are several such points, we take p' to be the rightmost. Similarly, 
since Y* n Rq is nonempty, we can take q' to be the topmost point in Y* n Rq and if there are several 
such points, we take q' to be the leftmost. Now, if the rectangle with corners p and q is unsatished in Y', 
then the rectangle with corners p' and q' is unsatished in Y*, contradicting the claim that Y* is a feasible 
solution. I 


Application 3: Recursively decomposable permutations. We generalize preorder access sequences 
to ^-decomposable permutations, and show that RGreedy has linear access cost for all such permutations. 
Let fc be a constant such that there is a tree decomposition T in which each node is a permutation of size 
at most k. Notice that a preorder sequence is 2-decomposable. 


Theorem D.5. The cost o/RGreedy on any k-decomposable access sequence is at most 0{nlogk). 


Proof. Applying Theorem D.2 
0{n). First, notice that the 
so we have J2peL{r) “ 


we obtain that RGreedy(P) < Greedy(P)- 1-X)pgL(r) Greedy(P)-|- 

elements that belong to the leaf blocks are induced by disjoint intervals, 
i. For each such leaf block, since Greedy is known to have logarithmic 


amortized cost, we have Greedy(P) < 0(|P|log|P|) = 0(|P| logic). Summing over all leaves, we have 
SpGL(r) Greedy(P) < 0(nlogic). 

Next, we bound the cost of non-leaf blocks. Since the numbers of children of non-leaf blocks add up to at 
most 2n, we have J2pgn(T) ^gEiin, using the fact that Greedy has amortized logarithmic cost, 

it follows that J2p£N{r) Greedy(P) < J^p^n^t) G(I^I logic) < 0(nlogic). This completes the proof. I 


We mention that the logarithmic dependence on k is essentially optimal, since there are simple per¬ 
mutations of size n with optimum cost r2(nlogn). In fact, for arbitrary k, the bound is tight, as shown 
in Appendix |G.2| In Theorem |1.4| we proved that for arbitrary constant k, the linear upper bound for 
Ic-decomposable permutations is asymptotically matched by Greedy. 


D.4 Proof of Theorem ID.21 

We are now ready to prove the decomposition theorem. Let Y be the output of RGreedy on a permutation 
of P of size n, and let T be the block decomposition tree of P. We wish to bound |y|. 

For each leaf block P G P(T), let Yp be the points in P n Y. For each non-leaf block P G N{T), with 
children Pi,..., Pfc, let Yp be the set of points in Y n (P \ IJ^- Pj). In words, these are the points in block P 
that are not in any of its children (we do not want to overcount the cost). It is clear that we have already 
accounted for all points in Y, i.e. Y = UpgT suffices to bound X^pgr l^^l- 

For each non-leaf block P G N{T), we show later that |Yp| < 4:-GKEEBY{P)+5degree{P)+junk{P), where 
degree{P) is the number of children of P, and the term junk{P) will be defined such that ^pg 7 v( 7 -) junk{P) < 

2n. For each leaf block P, notice that apart from the RGreedy augmentation in the last row, the cost is 
exactly Greedy(P). Including the extra points touched by RGreedy we have that |Yp| = Greedy(P) -|- 
\topwing{P)\. Since leaf blocks are induced by a partitioning of [n] into disjoint intervals, we have that 
'l2peL{T) \topwing{P)\ < n. So in total, the cost can be written as GREEDY(P)-l-X)peL(r) Greedy(P)-1- 

5n as desired. It remains to bound J2p^n{T) I^pI- 
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Consider any non-leaf block P. Define the rectangular regions R{i,j) as before. We split |Yp| into three 
separate parts. Let Yp be the subset of Yp consisting of points that are in the first row of some region R{i,j). 
Let Yp be the subset of Yp consisting of points that are in the last row of some region R{i,j). Observe that 
any points possibly touched in an RGreedy augmentation step fall into Yp. Let Y? = Yp — {Yp U Yj) be 
the set of touch points that are not in the first or the last row of any region R{i,j). 

Denote by left{R) and right{R) the leftmost, respectively rightmost column of a region R. Similarly, 
denote by top{R) and bottom{R) the topmost, respectively bottommost y-coordinate of a region R. 

Lemma D.6. If region R{i,j) has no block Pq below it, RGreedy touches no point in R{i,j). If R{i,j) 
has a block Pq below it, RGreedy touches only points in columns left{R{i, j)) and right{R{i, j)). 

Proof. If R{i,j) has no block Pq below it, then it is never touched. Assume therefore that R{i, j) is above (and 
aligned with) block Pq for some q < j. When RGreedy accessed the last element in Pq, it touched the left and 
right upper corners of Pq in the augmentation step (since the topmost element in the leftmost and rightmost 
columns belong to topwing (Pq)). As there are no more accesses after top{Pq) inside [left{Pq) + \, right{Pq) — l\, 
all elements in this interval are hidden in \left{Pq) -b 1, righfiPq) — 1], and therefore not touched again. I 


Let TOpj be the indicator variable of whether the region R{i,j) is touched by RGreedy. Consider the 
execution of Greedy on permutation P and define rij j to be the indicator variable of whether the element 
{i,j) is touched by Greedy on input P. Notice that = Greedy(P). We refer to the execution 

of Greedy on permutation P as the “execution on the contracted instance”, to avoid confusion with the 
execution of RGreedy on block P. 


Lemma D.7. Before the augmentation of the last row of P by topwing{P) the following holds: A rectangular 
area R{i,j) in block P is touched by RGreedy iff Greedy touches {i,j) in the contracted instance P; in 
other words, mij = Uij for all i,j. 


Lemma 


D.6 


and Lemma 


D.7 


together immediately imply the bounds |YJ| < 2 • Greedy(P), and |Y|| < 


2 • Greedy(P) + \ topwing{P)\, since in the first or last row of a region R{i,j) that is not one of the Pfis at 
most two points are touched and no point is touched if Greedy does not touch the region. In the case of 
Yp, RGreedy also touches the elements in topwing{P) after the last access to a point in P. 


Proof. We prove by induction on j that after Pj is processed, the rectangular area R{i,j) is touched by 
RGreedy if and only if {i,j) is touched by Greedy in the contracted instance. 

We consider the processing of Pj and argue that this invariant is maintained. When j = 1, the induction 
hypothesis clearly holds. There are no other touch points below the block P (due to the property of the 
decomposition), so when points in Pi are accessed, no points in P \ Pi are touched. Similarly, in the 
contracted instance when the first point in P is accessed by Greedy, no other points are touched. 

First we prove that Greedy touching {i,j) in the contracted instance implies that RGreedy touches 
R{i,j). Suppose Greedy touches point {i,j). Let pj be the access point in the contracted instance at time j. 
There has to be an access or touch point {i,j') where f < j such that the rectangle with corners {i,j'), and 
p' had no other points before adding {i,j) (the point {i,f) was on the stair of pj at time j). The induction 
hypothesis guarantees that R{i,j') is touched by RGreedy, so some point q is touched or accessed in this 
region. Choose q to be the topmost such point, and if there are more points in the same row, choose q to be 
as close to block Pj as possible. 

We claim that q is in the stair of the first access point p* of Pj, therefore touching R{i,j): If q was 
not in the stair, there would be another point q' in the rectangle formed by p* and q. Observe that q' 
cannot be in R{i,j'), because it would contradict the choice of q being closest to p*. So f is in some region 
R{i",j") that lies inside the minimal rectangle embracing R{i,j'), and the region immediately below Pj. By 
induction hypothesis, the fact that R{i",j") contains point q' implies that {i",j") is touched by Greedy in 
the contracted instance, contradicting the claim that the rectangle with corners ii,j'), Pj is empty. 

Now we prove the other direction. Suppose RGreedy touches region R{i,j). This also implies that 
Greedy (without augmentation step) already touches R{i,j). But then Greedy must touch a point in 
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R{i,j) when accessing the first element p* in Pj. Assume w.l.o.g. that R{i,j) is to the right of Pj. It 
means that there is a point q in the stair of p*, such that q is in some region R{i,j') for j' < j. If in the 
contracted instance {i,j') is in the stair of the access point pj at time j, we would be done (because Greedy 
would add point so asssume otherwise that this is not the case. Let be the point that blocks 

so the corresponding region R(i",j") lies inside the minimal rectangle embracing both Pj and R{i,j'). 
Also note that the region R{i",j") cannot be vertically aligned with Pj because P is a permutation. Thus 
i < %". The fact that {i”^j”) is touched by Greedy implies, by induction hypothesis, the existence of a 
point q' G R{i”,j”), and therefore the augmentation step of RGreedy touches some point q* on the top 
row of region R(i",j"). This point would prevent q from being on the stair of p*, a contradiction. I 

Lemma D.8. \topwing{P)\ < 2 • degree{P). 

Proof. topwing{P) contains at most 2 points in block R(i, k) for every k. I 

The only task left is to bound |Tj|, the number of touch points in non-first and non-last rows of a region. 
This will be the junk term mentioned earlier, that sums to 2n over all blocks P G N{T). In particular we 
show that every access point can contribute at most 2 to the entire sum J2p£N{T) \^p\- 

Consider any access point X = (x, fa,), and let P* denote the smallest block that contains A, and in which 
X is not the first access point (such a block exists for all but the very first access point of the input). Further 
observe, that every point touched at time G inside P* is accounted for already in the sum J2p^n(T) l^pl> 
since these points are in the first row of some region inside P*. It remains to show that at time G at most 
two points are touched outside of P*, excluding a possible RGreedy augmentation step (which we already 
accounted for). Let p* denote the first access point in P*, i.e. at time bottom{P*). At the time of accessing 
p* several points might be touched. Notice that all these points must be outside of P*, since no element 
is touched before being accessed. Let pi^p and pright be the nearest of these touch points to p* on its left, 
respectively on its right. 

We claim that for all the access points in P* other than the first access point p* , only piep and Pright can 
be touched outside of P*. Since, by definition A is a non-first element of P*, it follows that at time G at 
most two points are touched outside of P* , exactly what we wish to prove. 

Indeed, elements in the interval [l,p;e/t ~ 1]: and in the interval [pright + 1)R] are hidden in [l,pieft — 1], 
respectively in [pright + 1)^], in the time interval [bottom{P*) + I, top{P*)] and therefore cannot be touched 
when accessing non-first elements in P*. The case remains when p/e/t or Pright or both are nonexistent. In 
this case it is easy to see that if on the corresponding side an element outside of P* were to be touched at 
the time of a non-first access in P* , then it would have had to be touched at the time of p*, a contradiction. 

This concludes the proof. 


E 


Proof of Theorem 


2.1 


In this section, we prove the following theorem. 


Theorem E.l (Theorem 2.1 1 . Assume that there is a c-competitive online geometric BST for all per¬ 
mutations. Then there is a Ac-competitive online geometric BST A for all access sequences. 


Given a c-competitive online BST A^ that can serve a permutation access sequence, we construct a 
4c-competitive online BST A that can serve any (not necessarily permutation) access sequences. We first 
sketch the outline of the proof. 

Let A G [n]'" be a (possibly non-permutation) access sequence, and let T be the initial tree on [n]. First 
we show that given any online BST B that can handle accesses, there is a canonical way to augment B and 
obtain an augmented that can also handle insertions of elements. 

Then, we show that given any access-only sequence A G [n]"*, initial tree T and an online augmented 
BST that can serve permutation access and insert sequences, we can construct a permutation sequence 

(comprising of both accesses and insertions) A^’™^ G [to]"* to be fed to in an online manner. Then 
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there are two steps. First, we show that the execution of can define the execution of A that runs on 

X with initial tree T. Second, we show that the cost of A^’™^ can be bounded by O(0PT(X)). Now we 
formally describe the proof. 

Handling Insertion Given an online BST B, let be an algorithm that works as follows. To access 
element x, just simulate B. To insert element x, try searching for x and miss, i.e. touching the search path 
of predecessor and successor of x in the tree. Then add x as a leaf. Then simulate B as if B is accessing the 
leaf X. 


E.l Constructing the algorithm A 

In the following, we work on K x Z instead of the integer grid Z x Z in order to simplify the algorithm and 
the proof. We extend some notations and definitions accordingly. We use the words “y-coordinate” and 
“row” interchangeably. For any X,T C M x Z, by [^], we mean the point set obtained by “putting X above 
T” while the x-coordinate of each element does not change. A point set A C M x Z is a permutation if there 
is at most one point in X for any fixed x-coordinate and any fixed y-coordinate. A satisfied set is defined in 
the usual no-empty-rectangle way. An online BST B, given [^ ] C K x Z, outputs a satisfied in an 

online manner. 

To describe the algorithm A for Theorem |2.1[ we define two operations: split and merge. Given any 
access sequence X, we define split(X) as follows. Let e be an extremely small positive constant. For each 
point {x,y) G X, we have (x -I- eg,y) G split(X) except if (x,y) is the first access point of element x (i.e. 
there is no (x',y) G X where x' < x), in which case (x,?/) G split{X). The intention is to “tilt” the non-first 
accesses of all elements in the simplest way. Note that split{X) is a permutation. We define the reverse 
operation merge{S) = {([xj ,y) \ (x,y) G S'} for any set S. For any (non-permutation) sequence A G Z x Z, 
let AP’™®, A^ = split{X) as a point set. All points in A^ are access points. However, in AP’®"''*, only points 
with integral x-coordinate are access points, the remaining points are insertion points. That is, the first 
points of each element of A “become” access points in AP’®”®, the remaining points “become” insertion 
points. 

Given [ ^ ] G Z x Z as an input to A, we define: 


At{X) = mer(?e(A?,’™*(AP>®®®*)) 

Lemma E.2. If A^ is online, then A is an online process. 

Proof. Observe that split and merge fix y-coordinate. If A^ is online, then merge(Ay™^(AP’®®®®)) can be 
computed row by row. I 


Lemma E.3. For any (non-permutation) access sequence X, and initial tree T it holds that At (A) A A 
and is satisfied 

Proof. To show that At(A) A A, observe that A = merge{split{X)) as e is very small. Next, AP’®®®®* C 
AP,*ns(AP,*"s) because AP’®®®* is an online BST. Finally, as merge is monotone under inclusion, we have 

A = merge{split{X)) = merge{X^'™^) C men?e(A5'’“^(AP’®®®®)) = At(A). 


To show that 
is satisfied, merge{ 


At{X)- 
T - 


is satisfied, we claim that merge preserves satisfiability. Therefore, as 




•{X^ 

T 


A^' 


S(^XP 

T 


’) 


is satisfied. 


) = [AAxy 

It remains to prove the claim. Let 5" C M x Z be any satisfied set and S' = merge{S). Consider any 
points p', q' G S' where p' .x < q' .x and choose the rightmost (leftmost) associated point p {q) of p' {q') from 
S. As S is satisfied, there is a point r G S \ {p,q} where r G Dpq. In particular, p.x < r.x < q.x. Let 
r' = ([r.xj ,r.y) G S' be the image of r under merge, then we have r' G Dp'q' because [p.xj < [r.xj < \q.x\ 
and merge does not affect y-coordinate. Moreover, r' ^ p',q', otherwise p is not the rightmost associated 
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Figure 8: Illustration of S' in the proof of Lemma 
points. 


E.5 


The symbols x denote accesses and circles are touched 


point of p', or similarly for q'. So we have identified r' G S' \ {p',q'} where r' G Dp',' and therefore S' is 
satisfied. I 


By Lemma E.2 and Lemma E.3 A is an online process where AriX) 2 X and AriX) is satished. Therefore, 
A is an online (geometric) BST for accessing arbitrary (non-permutation) sequences. 


E.2 Competitiveness 

The idea is to bound which is the cost of the online BST which does both access and insert 

by the cost of the online BST running on X that only supports access. 

Let TP be a binary search tree with m elements corresponding to the distinct ^-coordinates of points in 
XP. The elements with integral values in TP form a BST with exactly same structure as T. Between any 
two integral value elements v and v -\- there will be a right path (each node has only right child) of size 
equal to the number of points from XP with x-coordinate from {v,v -\- 1). This tree is defined so that the 
following claim holds. 

Lemma E.4. =A^t^{XP). 

Proof. We assume that Ap does not touch elements in the initial tree that have never been in any of the 
search paths for access or insert. By induction on time step t, if the point {x,t) G XP’™'^ is an access point, 
then AP'™" and Ap do the same thing by definition. If the point {x,t) G XP’™" is an insertion point, then 
after AP’™'" adds a: as a leaf, the structure of the tree excluding elements which are never touched is same. 
So AP’™" and Ap do the same thing again. I 

Lemma E.5. For any access sequence (no insertions) X, and the correspondinq access sequence split(X), 
OPT{split{X)) < 40PT(X). 

Proof. We show that given OPT(X), we can construct a satisfied set S' of split (X) where |S"| < 20PT(X) -|- 
2m. 

We iteratively modify OPT(X) to obtain S'. Consider any column Ci of OPT(X) which contains more 
than one access point, say access points. Suppose that Ci contains pi > mi touch points (thus pi — mi 
non-access touch points), and lies between Ci-i and c^+i. We “split” Ci into mi columns Ci^, Ci^,... Ci.^, lying 
between Ci_i and c^+i. Columns and Ci^_ have pi points in the same rows as c^; in the lowest of them 
is an access point, and all others above are touched, whereas in the highest is an access point, and all 
others below are touched. The remaining columns Ci^, - ■ ■ Xim -i have two points each in them, one access 
and one touch point. The access point in column Ci. is in the same row as the jth access in original column 
Ci, and the touch point in column Ci- is in the same row as the {j -\- l)th access in Ci. See Figure 

It is easy to verify that, after placing these points, the set is still satished. The number of points placed 
in these columns is exactly 2pi 2(mi — 2). Thus after applying this modihcation to every column Ci, there 
are at most 20PT(X) -|- 2m < 40PT(X) points. I 
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Now we are ready to prove the main theorem. 


2.1 


First, we have that At{X) < because merge never increases the 


Proof of Theorem 

number of points. We conclude that 

At{X) < = A^t,^{XP) < cOPJ{split{X)) < 4cOPT(X), 

where the second inequality follows from Lemma |E.4[ the third inequality follows from the assumption that 


A^ is c-competitive for only accessing on only permutations, and the last inequality follows from Lemma E.5 
and hence is a 4c-competitive online BST for any access sequence. □ 


F Random permutation is hard 


We prove that the cost of most permutations is n(nlogn) for all BST algorithm. A similar result has been 
shown by Wilber 39 for random access sequences (that might not be permutations). More formally, we 


prove the following theorem. 


Theorem F.l. There is a positive constant c such that for any L > 0 only a fraction 1/2^ of the permutations 
X have OPT(A) < (logn! — L — n — 1)/c. 


Proof. We use Kolmogorov complexity. Observe that the shape of a tree on k nodes can be encoded by 0{k) 
bits, e.g., by the preorder traversal. If the keys stored in the tree are clear from the context, the assignment 
of the keys to the nodes is unique. Let T be the initial tree, let X be any (permutation) access sequence, 
and let Cx be the total cost of OPT on X. We encode the shape of the initial tree with 0{n) bits. Consider 
now an access of cost k. It replaces an initial segment of the current search tree, the before-tree, by another 
tree on the same set of nodes, the after-tree. The before-tree must include the element accessed and has size 
0{k). We encode the before-tree and the after-tree with 0{k) bits and use a special marker to designate 
the accessed element in the before-tree. In total, we need c ■ {n + Cx) bits for the encoding for some small 
constant c. Note that X can be reconstructed from the encoding. We first reconstruct the shape of the initial 
tree. Its labelling by keys is unique. Consider now any access. Since the position of the accessed element 
in the before-tree is marked and since the before- and the after-tree are on the same set of keys, the search 
tree after the access is determined. We conclude that we have an injective mapping from permutations X 
to bitstrings of length c • (n -|- Cx)- Assume Cx < (logn! — L — n — 1)/c. Then the encoding of X has at 
most logn! — L — 1 bits and hence A is in a set of at most nl/2^ permutations. I 


G Other claims 

G.l Counterexamples for straightforward attempts 

Avoiding the same pattern is not enough. 

It might be tempting to conjecture that if X avoids P, then the Greedy touch matrix Greedy(A) avoids 
P as well (P refers both to a permutation pattern and its matrix representation). While this holds for the 
simple case P = (2,3,1) in the “no initial tree”-case (proof left as an exercise for the reader), in general it 
is not true. Figureshows a counterexample (P = (3, 2,1)). 

Greedy is not decomposable. 

One could prove a decomposition theorem similar to Theorem EH for Greedy, if Greedy were decompos¬ 
able. By this we mean that for a fc-decomposable input sequence, considering a certain level P = P[Pi,..., Pk] 
of the decomposition tree, Greedy touches a region R{i,j), iff the corresponding point {i,j) is touched by 
Greedy, when executed on the contracted instance P. Alas, this property does not hold for Greedy, as 
shown on Figure [Tol 


31 






o • o 
o • o 

• o o 

o • 
o • 

• 


Figure 9: Access points are black circles, points touched by Greedy are gray circles, 
output contains (3,2,1). 


Input avoids (3,2,1), but 
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Figure 10: Access points are black circles, points touched by Greedy are gray circles. Left: block decomposition 
of input, and Greedy execution. Top left region is touched. Right: Greedy execution on contracted 
instance. Top left point not touched. 


No permutation input-revealing gadget with initial trees. 

A permutation input-revealing gadget for Greedy would allow us to strengthen the bound in Theorem o 
from quasilinear to linear. In Figure we sketch a construction that shows that such a gadget can not 
exist, if initial tree is allowed, even if the input is the preorder sequence of a path. 

G.2 Tightness of 0{n\ogk) for ^-decomposable permutations 

In this section, we show that the upper bound of OPT(Ar) = 0(n log fc) for fc-decomposable permutations is 
tight, by arguing that there is a sequence that matches this bound. We will use the following claim, which 
is proved in § |D.4[ and restated here. 

Lemma G.l. Let X be any input sequence and T be its tree decomposition. Then 

OPT(X) > ^ OPT(P) - 2n 

P&T 

Let P' be a sequence of size k where OPT(P') = Q,{k\ogk). Then we construct the tree T which is a 
complete degree-fc tree of depth d, with each node v G T having the template Py = P'. Then n = k'^, and 
each non-leaf node v G N{T) has 0PT(P„) = Ll{klogk). There are k‘^~^ = n/k non-leaf nodes, so we have 
0PT(A:) > VL{k\ogk) ■ f = fl(nlogfc). 

G.3 Decomposition of ^-increasing sequences 

Given a permutation X = {Xi,... ,Xn), a subsequence of X of size A: is a sequence (A^i^,... ,Xi^), where 
1 < ii < ■ ■ ■ < ik < n. 

Lemma G.2. Let X G Sn be an arbitrary permutation. The following statements are equivalent: 

(i) X is {k,..., l)-free, 


32 
































(a) There exist pairwise disjoint subsequences Yi,Y 2 ,... ,Yk-i of X, such that Yi is increasing for all 
i = 1,... ,k — I, and |Yi| + • • • + l^fc-il = n. 

Proof (ii) ^ (i): 

Let Yi,..., Yfc_i be increasing subsequences of X, as in the statement of the lemma, and suppose there exists 
a subsequence X' = {Xi^,..., Xi^) of X order-isomorphic to the pattern {k,..., 1). Since X' is decreasing, 
no two elements of X' can be in the same subsequence Yi. This is a contradiction, since we have only k — 1 
subsequences. Therefore, such an X' cannot exist, and X is {k,, l)-free. 

(i) =» (ii): 

Assume that X is (fc,..., l)-free. We construct the decomposition of X into increasing subsequences 
Yi,..., Yk-i as follows. We look at the plot of permutation X, and we refer interchangeably to elements of 
X and points in the plot of X. Let Yi be the “wing”, i.e. the points in the plot of X that form an empty 
rectangle with the top left corner (by this dehnition, we have Xi G Yi). Clearly, Yi forms an increasing 
subsequence of X. We now remove the elements of Yi from X and similarly find Y 2 as the wing of the 
remaining points. We repeat the process, thereby finding Yi, Y 2 ,.... We claim that we run out of points 
before reaching Y^, thus constructing a decomposition of X as required by the lemma. 

Suppose otherwise that we have reached Y^, and consider an arbitrary point Xi^, G Y^. Since was 
not chosen in Y^-i, the rectangle with corners Xi^ and the top-left corner has some point from Yk-i. Pick 
such a point, and denote it as Observe that Xi^._^ > Xi^, and ik-i < ik- Since Xi^_^ was not chosen 

in Yfc_ 2 , we can pick a suitable Xi^_.^ in Yfc_ 2 . Continuing in this fashion, we construct the subsequence 
Xi^ ,..., Xii_ of X that forms the forbidden pattern (fc,..., 1), a contradiction. I 

G.4 Comparing easy sequences 

Easy sequences not captured by any known bounds. 

We show a sequence X where UB{X) = O(nlogn), k{X) = yXi, and OPT(A) = 0(n). By UB we denote 
the “unified bound” that subsumes both the working set, dynamic finger and entropy bounds. At this 
time, however, UB is not known to be matched by any online BST. UB can be seen as the sum of logs of 
“Li distances”. In other words, X is an easy sequence that is not captured by the known classes of easy 
sequences studied in the literature. 

Consider a point set V that is the perturbation of the ^/n-hy-^/n grid: Let i = ^/n, and 

V = {{ii+{j - l),j£ + i- 1) : i,j G [£]}. 

Denote the point {i£ + {j — l),j£ -I- (* — 1)) by p{i, j). It is easy to check that there are no two points aligning 
on a; or y coordinates: If p(i,j) agrees with p{i',j') on one coordinate, it implies that i = i' and j = j'. 
Therefore, this point set corresponds to a permutation X G S'„. Figure [l^i) illustrates the construction. 
The following lemma shows that k{X) = i/n. 

Lemma G.3. X contains all patterns a of size at most 1. 

Proof. Pattern cr corresponds to point set {{at, t)}^_^, this is order-isomorphic top(cri, l),p{a 2 , 2),... ,p{ak, k). 


Proposition G.4. UB{X) =fl(nlogn). 

Finally, we show that OPT(A) = 0{n) by describing an offline algorithm that achieves this bound. Our 
algorithm is similar to Greedy, where we augment some extra steps. First, partition [n] into 
such that Ij = {{j — 1)£ + l,j£}. Notice that the access sequence starts by touching elements p(1,1) € 
I\,p{2, 1) e J 2 , ... ,p{£, I) G h and then back to p{2, 1) G Ii and so on. This occurs for £ rounds. 

The algorithm proceeds just like Greedy, except that, when accessing p{i, j), after touching the minimal 
elements, it also touches the two boundaries of the interval U, i.e. {i — 1)£ -I-1 and i£. 

Proposition G.5. The algorithm produces a feasible solution. 
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Proposition G.6. The cost of this algorithm is 0{n). 

A slightly more involved inductive argument shows that Greedy too has linear cost on X. 
Proposition G.7. Greedy(A) = 0(n). 


Pattern avoidance and dynamic finger. 

We show examples that k(X) and the dynamic finger bound (denoted by DF{X)) are not comparable. 

hi) illustrates a sequence X where DF{X) = 0{n) and k{X) = TL{^Jnl \ogn). First, the sequence 


12 


Figure 

X starts with the perturbed grid construction of size f{n)-hy-f{n) where f{n) = \/n/ log n; then the sequence 
continues with a sequential access of length n — f{n)'^. Notice that, in the hrst phase of the sequence, the 
distances between consecutive points are |Ai+i — Xi\ = 0(/(n)), except for at most f(n) pairs, which 
have distance 0(/(n)^). In the second phase, the distances are one. So DF{X) = 0(/(n)^) log/(n) + 
0{n) = 0{n). On the other hand, the f{n)-hy-f{n) grid contains all permutations of size f{n), so we have 
k{X) = ^{^/n/ \ogn). 

Figure [T^ih) illustrates a 2-recursively decomposable sequence (thus having k{X) = 3) whose dynamic 
finger bound is O(nlogn); to see this, notice that |Ai_|_i — Xi\ > nj^ for i = l,...,n/2, so we have 
DF{X) > f log(n/2). 


Easy non-decomposable sequence. 

Figure [T^iv) shows the non-decomposable input sequence X = (n/2, n, 1, n — 1, 2,...) on which Greedy 
achieves linear cost (to see this, observe that Greedy touches at most three elements in each row). 


G.5 Why is path the roadblock to deque, traversal, split 


The deque-, traversal- and split conjectures are originally stated for splay trees. We first state the conjectures 
for any online BST A. 


Conjecture 2 (Deque conjecture [^). Starting with any initial tree T with n elements, the cost of A for 
inserting or deleting the current minimum or maximum elements m times is 0{m + n) 


Conjecture 3 (Traversal conjecture 34 ). Starting with any initial tree T with n elements, the cost of 


accessing a sequence X defined by the preorder sequence of some BST T' is 0{n). 


Conjecture 4 (Split conjecture [24| ). Starting with any initial tree T with n elements, the cost of A for 
splitting T by any sequence of n splits is 0(n). A split at element x is to delete x and to obtain two trees 
whose elements are smaller resp. larger than x, each subject to further splittings. 


Next, we state an easier conjecture which is implied by all three conjectures. 

Conjecture 5 (Path conjecture). Starting with any initial tree T with n elements, the cost of A for accessing 
a sequence X defined by the preorder sequence of some BST T' which is a path is 0(n). 

To express it in terms of pattern avoidance, the access sequences in the path conjecture avoid both (2, 3,1) 
and (2,1, 3), while the ones in traversal conjecture only avoid (2, 3,1). 

Intuitively, the sequence in path conjecture is a sequence that starts from minimum or maximum and 
moves inwards towards the middle, for example, (1, 2, 3,4,10,9, 5, 6, 8, 7). 

Theorem G.8. Let A be an online BST. 


(i) If A satisfies traversal conjecture, then A satisfies path conjecture. 

(ii) If A satisfies deque conjecture, then there is an online BST A' that satisfies path conjecture. 
(Hi) If A satisfies split conjecture, then there is an online BST A' that satisfies path conjecture. 
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Proof, (i) This is clear from the statement. 

(ii) If A satisfies deque conjecture, then A can delete the current minimum or maximum element n times 
with cost 0{n). Observe that the sequence of these deletions can be defined by a BST which is a path. 
Given A, we construct A' that satisfies path conjecture. To access x = 1 or n, simulates A by deleting 
X, but then before finishing, A! brings x back by making x the root. 

To access other elements x from the “minimum” (“maximum”) side, A! also simulates A^ but we can see 
that it must touch x — 1 (x + 1) as well. Again, instead of deleting x. A! makes x as a root. For any time, 
the cost of A! is at most the cost of A plus one. 

(hi) Splitting is equivalent to deleting when applied to the minimum or maximum element. Thus, if A 
satisfies split conjecture, then A satisfies deque conjecture without inserts. We can then use the argument 
of(ii). I 
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Figure 11: A Greedy execution with initial tree. Access points and initial tree are shown with black circles, points 
touched by Greedy are gray circles. Black rectangle marks a region in which all permutations are 
contained, but there is no access point inside. Gray rectangle marks initial tree. 
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Figure 12: From left to right: (i) “perturbed grid” construction when € = 5 with Greedy execution, (ii) example 
with low dynamic finger bound, but high pattern avoidance parameter, (iii) example with low pattern 
avoidance parameter and high dynamic finger bound, with Greedy execution, (iv) non-decomposable 
permutation with linear cost Greedy execution. 
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